Last few examples are taking long time to scrape #3

mipo57 · 2021-11-24T11:57:19Z

Currently, every thread has its own tor instance. Some instances are not working well, so they are changed until one is found. Usually, if request pool is not large (<10k urls), few instances do not find good tor route in time but will take tasks from the queue.

Currently, on the end of processing, we are killing processes with good tor routes (because there is nothing to do for them) and leave bad instances (because they are struggling to find route while they have taken the task from queue). We should probably redesign the code, to separate tor pools from task queue, so that we always pritoritize processing with good instances over processing with bad tor instances

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Last few examples are taking long time to scrape #3

Last few examples are taking long time to scrape #3

mipo57 commented Nov 24, 2021

Last few examples are taking long time to scrape #3

Last few examples are taking long time to scrape #3

Comments

mipo57 commented Nov 24, 2021