webDataExtractor explores all the web pages of a specific web site and extract from them: emails, Usernames, HTML comments, Telephone number and Links.
python webDataExtractor.py https://www.mySite.htb 1
python webDataExtractor.py <target url> [<level>]
Parameter | Description | Example |
---|---|---|
target url |
URL from which we pretend to extract data (emails, username, etc) | https://www.mySite.htb |
level |
level at which to stop scanning pages. If not specified the entire site will be scanned | 1 |
It's very simple
cd /opt
pip install beautifulsoup4
git clone https://github.com/dokDork/webDataExtractor.git