-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Make specific missing node / dangling edge reports for core ingests #507
Comments
Expressed as SQL, it's:
Which produces:
|
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
We could produce missing id lists for any ingest, but it may better focus our attention if we specifically produce a report of the nodes that didn't join in our hpoa_disease_to_phenotype and hpoa_gene_to_disease. Maybe even more specific and actionable would be that we make a report that captures a unique list of diseases (so subject in d2p, object in g2d) that are used in edges but not present in MONDO or the mappings file.
Possible next level would be to split this on MONDO terms that are used but not present (I assume that's a question of timing?) and non-MONDO terms, which means specifically that we lack mapping.
Ideally we should use some generalizable code for this, while still producing a more specific report.
The text was updated successfully, but these errors were encountered: