Syllabus scraper

DO NOT DELETE pdf folder

The scraper goes through a subaccount (or a CSV called course_IDs.csv) and looks through each course's 'Syllabus' tab and downloads all .pdf files present and renames each of them according to course code and year (into the pdf folder).

dl_data.csv tells you what files were downloaded for each course.

Instructions:

If you do not have Python, install it. If you have no experience with it, I recommend installing it through https://www.anaconda.com/download/.
Clone this GitHub repository.
Install all the dependencies using pip (first time use only). Use the command pip install -r requirements.txt through the command shell in the directory of your cloned GitHub repo.
Run the script. It will prompt you for your these things:
1. Token (Canvas API token)
2. Subaccount to run in
3. Term to search through

Please note this script is rather slow. Due to the risk of taking down the AWS server, all API calls are done on a single thread. This script will be rewritten soon.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
pdf		pdf
Canvas API Token.txt		Canvas API Token.txt
LICENSE		LICENSE
README.md		README.md
Syllabus searcher.py		Syllabus searcher.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Syllabus scraper

DO NOT DELETE pdf folder

Instructions:

About

Releases

Packages

Contributors 2

Languages

License

ubccapico/syllabus-scraper

Folders and files

Latest commit

History

Repository files navigation

Syllabus scraper

DO NOT DELETE pdf folder

Instructions:

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages