dhtcrawler is a DHT crawler written in erlang. It can join a DHT network and crawl many P2P torrents. The program save all torrent info into database and provide an http interface to search a torrent by a keyword.
-
Download mongodb and start it with text search, i.e:
mongod --dbpath db --setParameter textSearchEnabled=true
-
Download dhtcrawler source code
-
Use
rebar
to download and install all dependent librariesrebar get-deps
-
compile
rebar compile
-
starting the erlang shell with the ebin path added
erl -pa ebin
-
start dhtcrawler
crawler_app:start()
-
start the http front-end
crawler_http:start().
-
Open a web browser and point to
localhost:8000/index.html
see priv/dhtcrawler.config.
NOTE, when you change node_count
value in dhtcrawler.config, you'd better delete all files saved in dhtstate directory.