Skip to content

Latest commit

 

History

History
45 lines (34 loc) · 1.61 KB

README.md

File metadata and controls

45 lines (34 loc) · 1.61 KB

400 trash bags of grocery receipts + Neo4j

Help loading data and sample queries from the meetup talk.

Check out the slides here.

Running neo4j with the Instacart data

Prerequisites

To run this locally, you'll need

  • a local docker environment
  • the instacart data (in tar.gz format) downloaded and moved to the root of this project directory

You can download the data here, and must agree to the terms and conditions. For more information about the contents of the files, see this data dictionary.

If you don't want to use docker, you can also run neo4j in other ways, but for the ease of setup, I've chosen to use docker. The biggest thing you'll have to do manually if you don't use docker is move the raw data to the equivalent of /var/lib/neo4j/import for the method you're running neo4j.

Prepping the data

There's a script, prep-data.sh, that will untar the data and then scrub a CSV that has improperly escaped doublequotes. Once you've run that script, you should be able to run the docker container and start executing the import queries.

All in all that should look like:

./prep-data.sh
docker-compose up

Now check out the cypher directory of this project for queries to import the data and all the queries I use in the presentation.