An example of automated news archiving on a schedule. archive-example.py
contains the scraper, which takes articles from an author page and saves them to the Wayback Machine. Scheduling is handled by GitHub Actions.
For those interested in learning more about archiving.
U.S. government 3-2-1 rule documentation
Real-time cloud storage sync solutions for Box, Dropbox, and Google Drive
Motherboard's mass_archive
tool
IIPC's list of archiving tools
Boss and Broussard, 2017: Challenges of archiving and preserving born-digital news applications
Broussard, Archiving Data Journalism
The Commons: Activists Guide to Archiving Video
Wayback Machine General Information
Donate to the Internet Archive
Scrapy
, a Python web scraping framework