This is web scraper that powers cfpcalendar.com.
Data is collected from various sources and written to Airtable.
Things you'll need:
- Airtable account
- Twitter account
- Python 3 + virtualenv
-
Once you've created an Airtable account, use this link to copy the template Base to your own workspace.
-
From https://airtable.com/account generate your API key and make a note of this. This will be your
AIRTABLE_API_KEY
. -
Go to https://airtable.com/api and select your base (the one to which you copied the source one in step 1 above). The URL you go to will look like
https://airtable.com/appXXXXXXYYYYY/api/docs
- make a note of theappXXXXXXYYYYY
. This will be yourAIRTABLE_BASE_KEY
Create yourself API keys from https://developer.twitter.com/en/apps.
The easiest way to run this is in isolation, using virtualenv.
-
Clone the git repo
git clone [email protected]:coderanger/cfp-scraper.git
-
Create virtualenv
cd cfp-scraper virtualenv --python=python3 . source ./bin/activate.fish
(Use the
activate
script appropriate for your shell) -
Install required modules
pip install -r requirements.txt
-
Activate the virtualenv
source ./bin/activate.fish
(Use the
activate
script appropriate for your shell) -
Based on the credentials obtained above, run:
export TWITTER_CONSUMER_KEY=xxxxxx export TWITTER_CONSUMER_SECRET=xxxxxx export TWITTER_ACCESS_KEY=xxxxxx export TWITTER_ACCESS_SECRET=xxxxxx export AIRTABLE_API_KEY=xxxxxx export AIRTABLE_BASE_KEY=xxxxxx
-
Launch:
python main.py