Skip to content

Script for retrieving updated MARC authority records from external sources.

License

Notifications You must be signed in to change notification settings

dainst/marc_authority_harvester

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

62 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

marc_authority_harvester

Script for retrieving updated MARC authority records from external sources.

Prerequisites

For Python 3 dependencies see: requirements.txt. You may need to install additional dependencies for lxml, see the official Documentation.

Running the script

Run python3 main.py -h for instructions.

  • There are two supported output formats (MARC and MARCXML).
  • There are three sources for authority data supported, namely the iDAI.gazetteer, the Library of Congress and the iDAI.thesauri.
  • There are three ways to specify the date marking the oldest updates you are interested in:
    1. Specify a date in ISO format (YYYY-MM-DD).
    2. Specify a day(s) offset from your current date.
    3. Continue from the last day the script was executed (after each run, this date is stored in a log file in the output directory).
  • Alternatively, you can run the script without specifying a start date, creating a complete data dump.

About

Script for retrieving updated MARC authority records from external sources.

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages