Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Make specific missing node / dangling edge reports for core ingests #507

Open
kevinschaper opened this issue Aug 18, 2023 · 1 comment
Open
Assignees

Comments

@kevinschaper
Copy link
Member

We could produce missing id lists for any ingest, but it may better focus our attention if we specifically produce a report of the nodes that didn't join in our hpoa_disease_to_phenotype and hpoa_gene_to_disease. Maybe even more specific and actionable would be that we make a report that captures a unique list of diseases (so subject in d2p, object in g2d) that are used in edges but not present in MONDO or the mappings file.

Possible next level would be to split this on MONDO terms that are used but not present (I assume that's a question of timing?) and non-MONDO terms, which means specifically that we lack mapping.

Ideally we should use some generalizable code for this, while still producing a more specific report.

@kevinschaper kevinschaper self-assigned this Aug 18, 2023
@kevinschaper
Copy link
Member Author

Expressed as SQL, it's:

select distinct * from (
  select subject from dangling_edges 
  where provided_by = 'hpoa_disease_to_phenotype_edges' 
      and subject not in (select id from nodes) 
  union 
  select object from dangling_edges 
  where provided_by = 'hpoa_gene_to_disease_edges' 
      and object  not in (select id from nodes)
)

Which produces:

subject
OMIM:138900
OMIM:141749
OMIM:179620
OMIM:276400
OMIM:300910
OMIM:300932
OMIM:301106
OMIM:301107
OMIM:601550
OMIM:601551
OMIM:601800
OMIM:607276
OMIM:607447
OMIM:607457
OMIM:607486
OMIM:607514
OMIM:609027
OMIM:610762
OMIM:611664
OMIM:611724
OMIM:611742
OMIM:611862
OMIM:612042
OMIM:612052
OMIM:612267
OMIM:612362
OMIM:612460
OMIM:612542
OMIM:612556
OMIM:612560
OMIM:612630
OMIM:612797
OMIM:613098
OMIM:613463
OMIM:613508
OMIM:613566
OMIM:613589
OMIM:613793
OMIM:614193
OMIM:614374
OMIM:614419
OMIM:614490
OMIM:614689
OMIM:614745
OMIM:614752
OMIM:615018
OMIM:615021
OMIM:615221
OMIM:615311
OMIM:615602
OMIM:615881
OMIM:616060
OMIM:616093
OMIM:617956
OMIM:617966
OMIM:617970
OMIM:617995
OMIM:618018
OMIM:618057
OMIM:618079
OMIM:618406
OMIM:618807
OMIM:619812
OMIM:620116
OMIM:620150
OMIM:620207
OMIM:620296
OMIM:620300
OMIM:620303
OMIM:620305
OMIM:620306
OMIM:620310
OMIM:620311
OMIM:620312
OMIM:620313
OMIM:620314
OMIM:620315
OMIM:620316
OMIM:620317
OMIM:620319
OMIM:620320
OMIM:620321
OMIM:620322
OMIM:620323
OMIM:620326
OMIM:620327
OMIM:620330
OMIM:620331
OMIM:620332
OMIM:620333
OMIM:620342
OMIM:620343
OMIM:620344
OMIM:620351
OMIM:620352
OMIM:620353
OMIM:620354
OMIM:620356
OMIM:620357
OMIM:620358
OMIM:620359
OMIM:620364
OMIM:620365
OMIM:620366
OMIM:620367
OMIM:620368
OMIM:620369
OMIM:620370
OMIM:620371
OMIM:620372
OMIM:620374
OMIM:620375
OMIM:620376
OMIM:620379
OMIM:620383
OMIM:620384
OMIM:620386
OMIM:620389
OMIM:620393
OMIM:620398
OMIM:620400
OMIM:620409
OMIM:620410
OMIM:620411
OMIM:620415
Orphanet:106
Orphanet:1162
Orphanet:139498
Orphanet:1480
Orphanet:155
Orphanet:1648
Orphanet:206
Orphanet:209893
Orphanet:217031
Orphanet:2227
Orphanet:2335
Orphanet:238616
Orphanet:243377
Orphanet:243761
Orphanet:244275
Orphanet:276271
Orphanet:279
Orphanet:280110
Orphanet:284130
Orphanet:319681
Orphanet:319691
Orphanet:319698
Orphanet:319705
Orphanet:323
Orphanet:33271
Orphanet:34145
Orphanet:3435
Orphanet:35056
Orphanet:353225
Orphanet:357
Orphanet:362
Orphanet:387
Orphanet:40050
Orphanet:406
Orphanet:411533
Orphanet:411969
Orphanet:413
Orphanet:426
Orphanet:449262
Orphanet:458713
Orphanet:462
Orphanet:464293
Orphanet:521399
Orphanet:529819
Orphanet:555
Orphanet:564178
Orphanet:619
Orphanet:64738
Orphanet:64740
Orphanet:651
Orphanet:69127
Orphanet:751
Orphanet:771
Orphanet:802
Orphanet:825
Orphanet:85142
Orphanet:856
Orphanet:862
Orphanet:907
Orphanet:93385
Orphanet:95698
Orphanet:97562

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant