Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

update mappings, semantic types (without update to biolink v2) #61

Closed
wants to merge 6 commits into from

Conversation

colleenXu
Copy link
Contributor

@colleenXu colleenXu commented Jul 7, 2021

See the comments in the conversation of most commits for more details.

these fixes should help BTE handle most semantic types in the APIs it ingests

the updated mappings should also help BTE handle the IDs from the APIs/operations it uses

add semantic types that we already have in our kps (some are also already in the config file)

clinical finding is a semantic type in the multiomics wellness kp api that we could support
@coveralls
Copy link

coveralls commented Jul 7, 2021

Pull Request Test Coverage Report for Build 1044093884

  • 0 of 0 changed or added relevant lines in 0 files are covered.
  • No unchanged relevant lines lost coverage.
  • Overall coverage remained the same at 96.831%

Totals Coverage Status
Change from base Build 972320854: 0.0%
Covered Lines: 439
Relevant Lines: 444

💛 - Coveralls

@andrewsu
Copy link
Member

@colleenXu this PR feels very big to me -- seems to combine several different objectives here. We can discuss on Tuesday whether we want to break this PR into multiple smaller ones, but in the future it would be better to do that anyway I think.

large changes:

-  added Transcript, ClinicalFinding (for multiomics wellness kp api)

Smaller changes:

-  fix typos
-  added mapping fields
-  SmallMolecule: removed KEGG (not chemical ids), added LINCS / CAS / HMDB (for specific apis that use them)
-  Drug: added rxcui, ndc, and the chemical stuff
-  MolecularActivity: changed KEGG to KEGG.REACTION
-  BiologicalProcess: removed KEGG
-  CellularComponent: removed MetaCyc, Rhea
-  Pathway: add GO/SMPDB, change pharmgkb to pharmgkb.pathways
-  AnatomicalEntity: removed NCIT (not in api)
-  Cell: removed umls, hpo (couldn't find in the api)
changed clinvar ID to use variation ID (not rcv, which is the record of the variation + its clinical significance). 

also added another mapping field for hgvs
to match the changes to the src/config and src/common/types files. 

Also update to link to biothings rather than kevinxin90's repo
update RHEA tests to work based on molecular activity semantic type
change pubchem -> pubchem.compound

make CHEMBL7512_DB_IDS writing match the format of the other constants
@colleenXu colleenXu closed this Jul 19, 2021
@colleenXu colleenXu deleted the updateMappings branch July 28, 2021 18:56
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants