2.13.0
Nextclade Web 2.13.0, Nextclade CLI 2.13.0 (2023-03-21)
Attach sequences to a priori most likely node if reference tree contains "placement_prior"
Until now, when there were multiple positions with equal numbers of mismatches between a query sequence and reference tree position, Nextclade always attached the query sequence to the reference tree node with the fewest number of ancestors. Due to the way recombinants are placed in the SARS-CoV-2 reference trees, this meant that in particular partial sequences were often attached to recombinants. With most recombinants being rare, this bias to attach to recombinants was often surprising.
In this version, we introduce a new feature that allows to attach sequences to a priori most likely nodes - taking into account which positions on the reference tree are most commonly found in circulation. The information on the prior probability that a particular reference tree node is the best match for a random query sequence is contained in the placement_prior
reference tree node attribute. This attribute is currently only present in the most recent SARS-CoV-2 reference trees. The calculation can be found in this nextclade_data_workflows
pull request.
To give an example: a partial sequence may have as many mismatches when compared to BA.5 as it has to the recombinant XP. Based on sequences in public databases, we know that BA.5 is much more common than XP. Hence, the query sequence is attached to BA.5. Previously, the query sequence would have been attached to XP, because XP has fewer parent nodes in the reference tree.
The impact of the feature is biggest for partial and incomplete sequences.
Add custom phenotype values to the newly placed tree nodes
When available in the dataset, the phenotype values (such as ace2_binding
and immune_escape
) are written into all output files except Auspice tree JSON. This omission is now fixed, and these values are set as tree node attributes. This allows to see the values and colorings for phenotype values on the tree page, and when loading the output tree JSON file into Auspice.
Fix length of 3' unsequenced aminoacid ranges in Nextclade Web
Nextclade Web was showing right boundary of the unsequenced AA range on the 3' end of peptide sequences incorrectly - the range was longer than expected. The calculations were using length of a gene in nucleotides, where there should be length in codons. This is now fixed.
Fix incorrect indices in mutation badges
The mutation badges in various places in Nextclade Web could show position "0", even though they are supposed to be 1-based. This was due to a programming mistake, which is now corrected.
Fix input-pcr-primers
and input-virus-properties
URL params in Nextclade Web
The input-pcr-primers
and input-virus-properties
URL params were swapped in the code accidentally, so one was incorrectly setting the other. This is now fixed.
Ensure translation warnings in CLI
Due to an omission, Nextclade CLI and Nextalign CLI since v2 did not print sequence translation-related warnings to the console. This is now fixed.
Fix Google Search Console warnings
We resolved warnings in Google Search Console: added canonical URL meta tag, and added noindex
tag for non-release deployments. This should improve Nextclade appearance in Google Search.
Internal changes:
- freeze wasm-pack version to 0.10.3
Commit history
(click to expand)
Instructions
📥 Nextclade CLI & Nextalign CLI can be downloaded from the links in the "Assets" section just below. Click "Show all" at the bottom of the "Assets" section to show more download options. Note the difference between "nextalign" and "nextclade" files as well as differences in operating systems and computer architectures.
🌐 Nextclade Web is available at https://clades.nextstrain.org
🐋 Docker images are available at DockerHub
📚 To understand how it all works, make sure to read the Documentation