Tool is adapted for www.work.ua (soon will be configured for other Ukrainian job search websites).
All CVs with PRO label (from pages https://www.work.ua/resumes/?page={1..n}
) will be downloaded to "path" directory in parse_resume
method. To increase/decrease number of pages for crawling, change range in parse
method.
Install Scrapy and its dependencies from PyPI with:
pip install Scrapy
Note: sometimes this may require solving compilation issues for some Scrapy dependencies depending on your operating system, so be sure to check the Platform specific installation notes.
For more details: Scrapy installation guide
To start crawler:
cd resume_spider
scrapy crawl work_spider
For more details: Spiders — Scrapy 2.7.1 documentation