Skip to content

NHagar/NICAR-21

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 

Repository files navigation

NICAR-21

An example of automated news archiving on a schedule. archive-example.py contains the scraper, which takes articles from an author page and saves them to the Wayback Machine. Scheduling is handled by GitHub Actions.

Additional Resources

For those interested in learning more about archiving.

Data Storage

U.S. government 3-2-1 rule documentation

Real-time cloud storage sync solutions for Box, Dropbox, and Google Drive

Motherboard's mass_archive tool

IIPC's list of archiving tools

Storing non-text data

Boss and Broussard, 2017: Challenges of archiving and preserving born-digital news applications

Broussard, Archiving Data Journalism

Preserve This Podcast

The Commons: Activists Guide to Archiving Video

The Internet Archive

Wayback Machine General Information

The WARC file format

Donate to the Internet Archive

Technical Tools

GitHub Actions documentation

Scrapy, a Python web scraping framework

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages