Skip to content
/ browler Public

Selenium based web crawler. Easily crawl and scrape Javascript heavy websites.

Notifications You must be signed in to change notification settings

xinbin/browler

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Selenium Based Web Crawler

ReadTheDocs - http://browler.readthedocs.org/en/latest/

Requirements

  • redis brew install redis

Install

pip install browler

Uninstall

pip uninstall browler

Configuration

With Selenium Hub

config = {
            "browser": 'remote',
            "remote": {
                "url": 'http://localhost:49044/wd/hub',
                'browser': 'firefox'
            },
            "url": "https://en.wikipedia.org/wiki/Main_Page",
            "limit": 10,
            "processes": 2
         }

Local Firefox

config = {
            "browser": 'firefox',
            "url": "https://en.wikipedia.org/wiki/Main_Page",
            "limit": 10,
            "processes": 2
         }

About

Selenium based web crawler. Easily crawl and scrape Javascript heavy websites.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published