Slides, notebooks and a down-sampled datatset from the QLD AI NLP Fundamentals workshop (21.11.19).
Clone repo and install dependencies via:
pip install -r requirements.txt
Additionally, also install the spacy medium model via:
python -m spacy download en_core_web_md
trove_scrape
- pulling content from the trove archives via the trove API, API key requires creating a trove account (see website)spacy_analysis
- preprocessing, processing, formatting and analysis of trove documentsspacy_vis_benchmarking
- visualising spacy dependency parses, NER outputs and some very rough benchmarking around the multi-core processing and simple/complex spacy pipelines