Skip to content
This repository has been archived by the owner on Jan 22, 2020. It is now read-only.
/ dhtcrawler Public archive

dhtcrawler is a DHT crawler written in erlang. It can join a DHT network and crawl many P2P torrents.

Notifications You must be signed in to change notification settings

kevinlynx/dhtcrawler

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

26 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

dhtcrawler

dhtcrawler is a DHT crawler written in erlang. It can join a DHT network and crawl many P2P torrents. The program save all torrent info into database and provide an http interface to search a torrent by a keyword.

screenshot

Usage

  • Download mongodb and start it with text search, i.e:

      mongod --dbpath db --setParameter textSearchEnabled=true
    
  • Download dhtcrawler source code

  • Use rebar to download and install all dependent libraries

      rebar get-deps
    
  • compile

      rebar compile
    
  • starting the erlang shell with the ebin path added

      erl -pa ebin
    
  • start dhtcrawler

      crawler_app:start()
    
  • start the http front-end

      crawler_http:start().
    
  • Open a web browser and point to localhost:8000/index.html

Config

see priv/dhtcrawler.config.

NOTE, when you change node_count value in dhtcrawler.config, you'd better delete all files saved in dhtstate directory.

About

dhtcrawler is a DHT crawler written in erlang. It can join a DHT network and crawl many P2P torrents.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published