-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
CEUR-WS metadata graph/tree procurement #22
Comments
Using py-3rdparty-mediawiki library code it should be possible to create pages for each volume in a systematic/way. For copyright reasons we'll have to start the trial from the most current volumes and work our way backwards. We'll start with a few dozen pages. CC0 is available for around Volume 15xx up |
A Jinja 2 template could describe the page |
Output should be like https://confident.dbis.rwth-aachen.de/ceur-ws/index.php?title=Vol-2801 taken from the html as shown in the talk page ... |
https://github.com/ailabitmo/sempubchallenge2014-task1 shows a solution done a few years ago which is IMHO way too complex but might have useful bits and pieces of code and background information. |
Input:
see e.g. https://github.com/WolfgangFahl/ProceedingsTitleParser/blob/master/ptp/ceurws.py
The text was updated successfully, but these errors were encountered: