Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Adding athena_example #120

Open
wants to merge 19 commits into
base: main
Choose a base branch
from
6 changes: 6 additions & 0 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -90,6 +90,12 @@ See `pgsql_big_dedupe_example/README.md` for details

This is the same example as the MySQL IL campaign contributions dataset above, but ported to run on PostgreSQL.

### Athena example - IL campaign contributions
See `athena_example/README.md` for details

This is the same example as the MySQL IL campaign contributions dataset above, but ported to run on Athena.



## Training

Expand Down
19 changes: 19 additions & 0 deletions athena_example/README.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,19 @@
# Athena Example

Takes a database of IL campaign contribution data, loads it in to a
Athena database, and identifies the unique donors.

To follow this example you need to

* Update `athena_example/config.py` with your Athena credentials, database name and the path to sroe the data
* Install dependencies, `pip install -r requirements.txt`

Once that's all done you can run the example:

```bash
cd athena_example
python athena_init.py
python athena_example.py
```

(use 'y', 'n' and 'u' keys to flag duplicates for active learning, 'f' when you are finished)
Loading