Human Cell Atlas bundle importer for Neo4j

Scripts for loading a bundle from ingest or the HCA datastore into Neo4j.

Install the packages

pip install -r requirements.txt

Start a Neo4j docker instance

docker run --rm -e NEO4J_AUTH=none -e NEO4J_apoc_import_file_enabled=true -p 7474:7474 -v $PWD/plugins:/plugins -v $PWD:/import -p 7687:7687 neo4j:3.3.3

Loading a bundle from the datastore

Provide the bundle UUID and the environment (dev, integration or staging)

./bundle_to_neo.sh -b f804f372-6d3e-46d6-ba00-cec3c75122c7 -n integration

Creating a bundle from ingest

You'll need access to the latest hca-ingest libraries. If you've run this before do pip uninstall hca-ingest to remove the old ingest client. Then checkout the ingest library from https://github.com/HumanCellAtlas/ingest-client. To install the latest ingest client do pip install -e <path to ingest-client>

Run the following command. -e is the submission envelope uuid, -p is the assay process uuid (this must be the final sequencing process uuid), -D for doing a dry run (doesn't export to blue) and -o is the directly where the bundle json files will get written beofre loading into neo4j.

./bundle_to_neo.sh -e 86243a5a-b869-4da3-b97f-759ba26b0e2c -p afd5ad73-353b-4d04-b045-15736f7cf53c -D True -o output

Creating extended graph links from ingest

The version of the script creates the modified graph linking json identified at the Cambridge Biohackathon. The script doesn't download entire bundles, it just creates the linking. It does so for all the data in a given submission.

Run the following command. -e is the submission envelope (Mongo) ID, -n specifies the environment (staging, integration or dev) in which to run the script and -o is the directly where the linked graph json files will get written.

./bundle_to_neo.sh -e 5ba0bd5eb32e850007eb6dd6 -n staging -o output

Query neo4j

Go to http://localhost:7474 and run this query MATCH p=()-->() RETURN p

Name		Name	Last commit message	Last commit date
Latest commit History 32 Commits
.idea		.idea
hca_bundle_neo4j		hca_bundle_neo4j
plugins		plugins
README.md		README.md
README_Advanced_Linking.md		README_Advanced_Linking.md
bundle_neo4j.jpeg		bundle_neo4j.jpeg
bundle_to_neo.sh		bundle_to_neo.sh
graph_analyser.py		graph_analyser.py
graph_analyser.sh		graph_analyser.sh
ingest_bundle_to_neo4j.py		ingest_bundle_to_neo4j.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Human Cell Atlas bundle importer for Neo4j

Install the packages

Loading a bundle from the datastore

Creating a bundle from ingest

Creating extended graph links from ingest

Query neo4j

About

Releases

Packages

Contributors 2

Languages

simonjupp/hca_bundle_to_neo4j

Folders and files

Latest commit

History

Repository files navigation

Human Cell Atlas bundle importer for Neo4j

Install the packages

Loading a bundle from the datastore

Creating a bundle from ingest

Creating extended graph links from ingest

Query neo4j

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages