-
Notifications
You must be signed in to change notification settings - Fork 61
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
ENH: Show unaliased Pango lineage in tooltip #984
Comments
@lucyintheskyzzz Pango lineages sometimes go a bit crazy, like B.1.23.456.7.89, so at some point Pango folks invented aliases. In Nextclade the "lineage" column nowadays shows the alias and "unaliased" column shows either partially aliased or canonical lineage, i.e. with aliasing cancelled to some degree or entirely. You can see how aliases are helpful on some of the example sequences: This is not related to Nextclade itself. Aliases is a Pango thing and Nextclade mimics Pango. But yes, our column names are a bit backwards (for legacy reasons) and perhaps our usage of the jargon term "unaliased" is a bit frivolous. |
Hi @ivan-aksamentov do you know where I can find if my sample had del 69-70? |
@lucyintheskyzzz If you mouse-hover a value in the column "Del", a tooltip will show up displaying a list of nucleotide deletions and a list of aminoacid deletions. If Alternatively, you can mouse-hover the black markers in sequence view (last column) and, if detected, deletions will also be displayed there: You can also download the TSV output file: and open it in Excel, Google Sheets or other spreadheet software or analyze it programmatically. The Finally, you can download translated polypeptide sequences (in fasta format) and open them in any alignment viewer software (such as AliView or https://alignmentviewer.org/) and then see if columns 69 and 70 has P.S. I suggest you open a new issue for each question, suggestion or bug report, or join our discussion forum (https://discussion.nextstrain.org), instead of posting into unrelated issues. This will make it easier for developers and other users to navigate in GitHub issues. |
OK thanks! |
Even though I spend about a lot of time working with Pango lineages, I get confused what the aliases correspond to.
It would be amazing if we could show the unaliased Pango lineages in a tooltip.
This should be pretty easy to do:
Where do we get the aliases from? We could store them in the tree, in the .extra properties for Nextclade. Or we put them in virus properties, or an optional input file - or we download directly from Github (https://raw.githubusercontent.com/cov-lineages/pango-designation/master/pango_designation/alias_key.json).
Then we just need to have a bit of JS code to notice when the feature should be switched on (Nextclade Pango column contained in dataset). And it's a simple lookup: take the letters in
alias_key.json
and tag on the dots.This could be very helpful for a lot of people to learn what new aliases correspond to.
I know that @chaoran-chen is working on a similar feature for covSpectrum.
See here for why this would be useful:
The text was updated successfully, but these errors were encountered: