Skip to content
This repository has been archived by the owner on Nov 13, 2021. It is now read-only.

Latest commit

 

History

History
30 lines (22 loc) · 786 Bytes

README.md

File metadata and controls

30 lines (22 loc) · 786 Bytes

Rugby data scraper

This repository provides Scrapy spiders to crawl rugby match/team/players data from the espnscrum.com

Installation and usage

$ pip install -r requirements
$ cd scraper
$ scrapy crawl espn

The scraper stores scraped data into a SQLite database in /tmp/

Available data

  • Matches
    • Infos (teams, ground, date)
    • Basic statistics (tries, conversions, penalties, drops)
    • Extended statistics (kicks, mauls, etc.)
  • Players
    • Basic info (name, birth date, height, weight)
    • Basic statistics per match (tries, conversions, penalties, drops)
    • Extended statistics (meters run, tackles, etc.)
  • Game events (match, team, player, time, type)

Authors

  • Jean-Eudes Peloye
  • Antoine Redier