Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Check species mappings genes/proteins #9

Open
2 of 3 tasks
DeniseSl22 opened this issue Oct 30, 2019 · 0 comments
Open
2 of 3 tasks

Check species mappings genes/proteins #9

DeniseSl22 opened this issue Oct 30, 2019 · 0 comments
Assignees

Comments

@DeniseSl22
Copy link
Contributor

DeniseSl22 commented Oct 30, 2019

See WP2533_r107133, which has for three genes a Uniprot ID pointing towards an incorrect species (2x Arabidopsis thaliana and 1x Saccharomyces cerevisiae) compared to the species for the PW (Homo sapiens).

  • Glycerol-3-phosphate 2-O-acyltransferase EC 2.3.1.198
  • Phosphatidylinositol-4,5-bisphosphate 3-kinase EC 2.7.1.153
  • Acylglycerone-phosphate reductase EC 1.1.1.101

In the RDF, mappings to Ensembl exist, which didn't start with "ENSG" (the code for Human IDs).
These mappings are not in the website linkouts, this uses the API and filters on species, where the RDF uses all mapping files simultaneously.
So we could use this additional mapping to our advantage, to track species which start with the wrong Ensembl code for the species of the PW.
I'll fix the three IDs above, but it would be good to make a Unit test out of this I believe.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants