Skip to content

It explores all the web pages of a specific web site and extract from them: emails, Usernames, HTML comments, Telephone number and Links.

License

Notifications You must be signed in to change notification settings

dokDork/webDataExtractor

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 

Repository files navigation

webDataExtractor

License

Description

webDataExtractor explores all the web pages of a specific web site and extract from them: emails, Usernames, HTML comments, Telephone number and Links.

Example Usage

python webDataExtractor.py https://www.mySite.htb 1

and this is a possible result:

Command-line parameters

python webDataExtractor.py <target url> [<level>]
Parameter Description Example
target url URL from which we pretend to extract data (emails, username, etc) https://www.mySite.htb
level level at which to stop scanning pages. If not specified the entire site will be scanned 1

How to install it on Kali Linux (or Debian distribution)

It's very simple

cd /opt
pip install beautifulsoup4
git clone https://github.com/dokDork/webDataExtractor.git

About

It explores all the web pages of a specific web site and extract from them: emails, Usernames, HTML comments, Telephone number and Links.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages