Implementation of crawlers and their manager written on python Crawlers can collect all text, structured data and links from the given list of webpages
Install python libraries with
pip install --no-cache-dir -r requirements.txt
cd python-crawler
python main.py "in file" "max depth" "number of threads" "concurrent_tasks" "max_queue_size" "max_cycles" "delay"
To build container:
./build.sh