-
Notifications
You must be signed in to change notification settings - Fork 26
Issues: medialab/minet
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Command to add jobs to a crawler's queue
enhancement
New feature or request
#982
opened Jul 24, 2024 by
Yomguithereal
When -c is not specified, we should default to test all available browsers instead of only firefox
dx
enhancement
New feature or request
#975
opened May 31, 2024 by
Yomguithereal
Scrapping 1000's of comments on Instagram
bug
Something isn't working
question
Further information is requested
#968
opened May 13, 2024 by
Geminy3
Retrieve videos from instagram hashtag function
bug
Something isn't working
enhancement
New feature or request
#967
opened Apr 25, 2024 by
Tyrannas
ThreadsafeBrowser enhancements
enhancement
New feature or request
#963
opened Apr 16, 2024 by
Yomguithereal
3 of 4 tasks
There should be a Crawler side global callback for each job
enhancement
New feature or request
#954
opened Apr 12, 2024 by
Yomguithereal
Add some crawler level job filter
enhancement
New feature or request
#950
opened Apr 3, 2024 by
Yomguithereal
path column is not very useful when using --glob file on scrape/extract
bug
Something isn't working
#939
opened Feb 21, 2024 by
Yomguithereal
Try to scrape twitter embed for tweets rather than using the Guest scraper
enhancement
New feature or request
investigation
#933
opened Jan 18, 2024 by
Yomguithereal
Add --raw flag to yt captions
enhancement
New feature or request
#929
opened Jan 15, 2024 by
Yomguithereal
Possible issue with jobs duplication if crawler crashes somewhere around the processing?
bug
Something isn't working
#928
opened Jan 12, 2024 by
Yomguithereal
The crawler might ask a spider to process an already visited url even if the end_url is already in cache
bug
Something isn't working
#925
opened Dec 19, 2023 by
Yomguithereal
Scrape command -m, -e and --field
enhancement
New feature or request
#922
opened Dec 11, 2023 by
Yomguithereal
4 of 5 tasks
Add extraction method to scrape command related to social network account displayed on homepages
enhancement
New feature or request
#918
opened Nov 29, 2023 by
Yomguithereal
Add method to response to return stripped body and/or add middleware to edit response body on the flight
enhancement
New feature or request
#914
opened Nov 28, 2023 by
Yomguithereal
Add some diagram for the crawler's lifecyle and architecture
documentation
#911
opened Nov 21, 2023 by
Yomguithereal
Integrate a file writer to the http executor like the crawler? also integrate the folder strategy?
enhancement
New feature or request
refactor
#906
opened Nov 9, 2023 by
Yomguithereal
Previous Next
ProTip!
no:milestone will show everything without a milestone.