From 8e76527417000e5d139cc972f579f9ccb6d99d76 Mon Sep 17 00:00:00 2001 From: Josh <30379833+josh-chamberlain@users.noreply.github.com> Date: Mon, 3 Jun 2024 11:44:37 -0400 Subject: [PATCH] Update README.md --- README.md | 6 +++--- 1 file changed, 3 insertions(+), 3 deletions(-) diff --git a/README.md b/README.md index 6a8947a..57ffe15 100644 --- a/README.md +++ b/README.md @@ -1,6 +1,6 @@ # automatic-archives -These Python scripts extract a series of URLs and their metadata from a JSON file, then caches them at the Internet Archive based on their update frequency. +This tool uses the [PDAP API](https://docs.pdap.io/api/endpoints/data-sources-database) to fetch data sources, then uses the [Save Page Now API](https://docs.google.com/document/d/1Nsv52MvSjbLb2PCpHlat0gkzw0EvtSgpKHu4mk0MnrA/edit#heading=h.1gmodju1d6p0) at the Internet Archive based on their update frequency. -They are set up to run with a GitHub Actions workflow. +Then, it uses the PDAP API to update the Data Sources' `last_cached` and `url_status` properties. -The JSON URL data comes from [this repository](https://github.com/Police-Data-Accessibility-Project/data-sources-mirror). +The script is set up to run with a GitHub Actions workflow.