Skip to content

santoshmo/PGM-Project

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

PGM-Project

PGM Project

Contact "[email protected]" with any problems.

#Crawler Requires AlchemyAPI and PyGoogle. To install AlchemyAPI, use the following. Note, "API_KEY" should be replaced with "api_key.txt" from the Crawler folder.

git clone https://github.com/AlchemyAPI/alchemyapi_python.git
cd alchemyapi_python
python alchemyapi.py YOUR_API_KEY
python example.py

To install pygoogle (the relevant files are provided in Crawler/), simply run:

python setup.py install

Run

python genData.py 

Enter your search term. The resulting output should be the top N number of links from Google, named "Query0.txt", "Query1.txt",...,"QueryN.txt". These contained cleaned HTML, without tags, ads, or other nonsensical data.

#Algorithms Use algorithms in Algorithms to run tests and benchmark various queries

About

PGM Project

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published