diff --git a/docs/infrastructure.md b/docs/infrastructure.md index cd29300c..22c4a88d 100644 --- a/docs/infrastructure.md +++ b/docs/infrastructure.md @@ -554,7 +554,9 @@ You can choose to omit the flag or explicitly specify `--no-clear-data` (default Including these relationships in a graph is important to be able to answer questions such as how many different diagnoses are represented in a graph database, to query for higher-order concepts for a given variable, and more. The participant variables modeled by Neurobagel are named using Neurobagel's own vocabulary (for more information, see this page on [controlled terms](./term_naming_standards.md)). -This vocabulary, which defines internal relationships between vocabulary terms, is serialized in the file [`nb_vocab.ttl`](https://github.com/neurobagel/api/blob/main/vocab/nb_vocab.ttl) available from the `neurobagel/api` repository. +This vocabulary, which defines internal relationships between vocabulary terms, +is serialized in the file [`nb_vocab.ttl`](https://github.com/neurobagel/api/blob/main/vocab/nb_vocab.ttl) available from the `neurobagel/api` repository. +If you have cloned the api repository, you will already have downloaded the vocabulary file. **The `nb_vocab.ttl` file should be added to every created Neurobagel graph database.** This can be done using the same script we used to upload the dataset JSONLD files, [`add_data_to_graph.sh`](https://github.com/neurobagel/api/blob/main/add_data_to_graph.sh), which adds all `.ttl` and/or `.jsonld` files in a given directory to the specified graph.