Skip to content

This is a Java-based multi-thread web crawler aims for collect images with certain keywords.

License

Notifications You must be signed in to change notification settings

EdenWuyifan/LabeledImageCrawler

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

LabeledImageCrawler

This is a Java-based multi-thread web crawler aims for collect images with certain keywords.

How to use

Installation

Please open and run the project on Eclipse. To install Eclipse: https://www.eclipse.org/downloads/

Features

image
  • Root URL: root for starting a crawl (starting with http:// https://)
  • Keywords: the label you want to crawl, for multi-label, use comma (e.g. cat,dog)
  • Max depth: the max depth for growing the crawl tree
  • Start Crawl: start a new crawl job
  • Stop Crawl: interrupt the current crawl
  • View DB: go to the DB view (see below)
image

Live Demo

Link to DEMO: https://www.youtube.com/watch?v=TC86LboiYoU

About

This is a Java-based multi-thread web crawler aims for collect images with certain keywords.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages