Documents scraped from Open Directories #11

upintheairsheep · 2023-02-13T17:54:18Z

https://odcrawler.xyz/
You have the ability to search by document type, you should get all the html, htm, pdf, doc, docx, json, xls, xlsx, java, xml, js, css, py, ppt, pptx, txt, csv, md, odf, vtt, srt, and tex files in the Collection and scrape each and every one of them, including the directory listings themselves, and make sure to integrate their names. Make sure to exclude certain directories that are datasets themselves, which will be added separately.

upintheairsheep mentioned this issue Feb 14, 2023

Internet Archive #13

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Documents scraped from Open Directories #11

Documents scraped from Open Directories #11

upintheairsheep commented Feb 13, 2023 •

edited

Loading

Documents scraped from Open Directories #11

Documents scraped from Open Directories #11

Comments

upintheairsheep commented Feb 13, 2023 • edited Loading

upintheairsheep commented Feb 13, 2023 •

edited

Loading