Skip to content

Commit

Permalink
Update schema.py
Browse files Browse the repository at this point in the history
  • Loading branch information
EthanSteinberg authored Jul 30, 2024
1 parent 3f7c441 commit 21a48d0
Showing 1 changed file with 9 additions and 6 deletions.
15 changes: 9 additions & 6 deletions src/meds/schema.py
Original file line number Diff line number Diff line change
Expand Up @@ -18,12 +18,15 @@
# - data/
# A (possibly nested) folder containing multiple parquet files containing patient event data following the events_schema folder.
# glob("data/**/*.parquet") is the recommended way for obtaining all patient event files.
# - dataset_metadata.json
# Dataset level metadata containing information about the ETL used, data version, etc
# - (Optional) code_metadata.parquet
# Code level metadata containing information about the code descriptions, standard mappings, etc
# - (Optional) patient_split.csv
# A specification of patient splits that should be used.
# - metadata/
# A (possibly nested) folder containing metadata for the dataset. Any files within this folder are optional, but we also specify the format of 3 particularly
# important metadata.
# * dataset_metadata.json
# Dataset level metadata containing information about the ETL used, data version, etc
# * (Optional) code_metadata.parquet
# Code level metadata containing information about the code descriptions, standard mappings, etc
# * (Optional) patient_split.csv
# A specification of patient splits that should be used.

############################################################

Expand Down

0 comments on commit 21a48d0

Please sign in to comment.