Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Parsing mouse ensembl gtf into gene-level data gives duplicate symbols #8

Open
lianos opened this issue Jun 15, 2018 · 0 comments
Open

Comments

@lianos
Copy link
Contributor

lianos commented Jun 15, 2018

My parsing of the mouse ensembl gtf into gene-level data gives feature file assigns "Olfr912" and "Srp54a" to more than one ensembl identifier.

For instance, Olfr912 gets assigned to ENSMUSG00000111448 (correctly) but also ENSMUSG00000060114 (incorrectly). The latter should be Olfr910. The archs4 "gene_name" gets this right, so this is now being used for the "symbol" column in commit a34e466

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant