Releases: MBoemo/DNAscent
4.0.3
- DNAscent is now fully compatible with Dorado and POD5. These are now recommended over Guppy and FAST5.
- DNAscent outputs BrdU and EdU calls in modbam format. This is now recommended over the previous human-readable table of BrdU and EdU probabilities.
- The workflow and visualisation pages of the documentation have been revamped to show how to use these changes.
- Guppy, FAST5, and human-readable detection outputs are all still supported to avoid breaking users’ previous workflows.
- The modbam MM tag for EdU and BrdU specifies N+e? and N+b?, respectively. Users familiar with the SAM optional fields specification will notice that these are actually the one-letter codes for 5fU and 5caU. We have done this as a temporary measure so that BrdU and EdU can be easily shaded and viewed in IGV (View –> Preferences –> Base Mods). We will replace these one-letter codes with the ChEBI IDs for BrdU and EdU as soon as this is supported by IGV.
4.0.2
- The deep learning model is the same as in v4.0.1. Users should see nearly identical performance in BrdU and EdU calling.
- Fork speed and stall calling has been fine-tuned from v4.0.1. Users should now see a closer match between v4.0.2 on R10 pores and that of v3.1.2 on R9 pores.
- Starting with this release, we will be providing supported Singularity images here (https://cloud.sylabs.io/library/mboemo). This is now the recommended way to use and run DNAscent.
- There was a longstanding issue of different basecallers producing slightly different sequencing summary formats which could trip up DNAscent index. Previously, we provided various flags for DNAscent index to account for the different formats but this was more complicated than it needed to be. DNAscent index will now tune itself for whatever sequencing summary format you have.
- v4.0.1 had an issue where one of the model layers was not properly optimised for GPU usage by TensorFlow 2.12.0. This was causing slow runtimes of DNAscent detect on certain GPUs. The issue is fixed in this release, although it required a rollback to TensorFlow 2.4.1 (at least for now). Part of our aim in releasing and supporting Singularity images was to mitigate any inconvenience that might have been caused by the need to change CUDA and CuDNN versions.
- POD5 and Dorado support are coming in a 4.0.3 release which will be shipped in the near future.
4.0.1
This is a pre-release. It is possible (maybe even probable) that the deep learning model and other parts of the software will be re-tuned in the coming weeks as we listen to community feedback.
-
All DNAscent executables now work on Oxford Nanopore R10.4.1 pores. DNAscent detect detects the thymidine analogues BrdU and EdU in single molecules as in v3.0.2 and v3.1.2.
-
POD5 is not yet supported but POD5 support is planned for v4.0.2 which will be the LTS release. POD5 can be converted to FAST5 using the pod5 convert to_fast5 utility (https://pypi.org/project/pod5/#pod5-convert-to_fast5).
-
DNAscent’s deep learning models were trained on R10.4.1 flow cells with 5kHz sampling which is now the default sampling rate on Oxford Nanopore platforms. Using DNAscent with reads sequenced with the older 4kHz sampling rate is not recommended.
-
DNAscent index still uses the sequencing_summary.txt file from Guppy and legacy versions of Guppy are available on the Oxford Nanopore Community webpage. Compatibility with Dorado is planned for v4.0.2 LTS.
-
Tensorflow updated to 2.12.0 and, correspondingly, GPU usage now requires CUDA 11.8 and cuDNN 8.9.
-
Useage is otherwise identical to v3.1.2.
-
Training data for this release was provided by Mathew Jones at the University of Queensland. This software was developed in collaboration with the Jones Lab, Merrick Lab, and McClelland Lab to whom we are grateful for their collaboration and support. We are particularly grateful to them for supporting the release of the software to the community ahead of publication.
3.1.2
- DNAscent forkSense now assigns a stall score to each called fork,
- DNAscent forkSense can assign a replication stress signature to each called fork,
- DNAscent detect no longer outputs the reference 6mer corresponding to each thymidine position in order to reduce output file size,
- improvements to fork calling and segmentation,
- Released with Jones MJK, Rai SK, Pfuderer PL, Bonfim-Melo A, Pagan JK, Clarke PR, McClelland SE, Boemo MA. A high-resolution, nanopore-based artificial intelligence assay for DNA replication stress in human cancer cells. bioRxiv.
3.0.2
- DNAscent detect now detects two different thymidine analogues, BrdU and EdU, in the same molecule,
- DNAscent forkSense now uses the spatial patterning of EdU and BrdU to determine fork direction as in DNA fibre,
- dnascent2bedgraph utility updated to plot both EdU and BrdU tracks in genome browsers,
- DNAscent regions is now deprecated and has been fully superceded by DNAscent forkSense,
- DNAscent psl is now deprecated as reads can be more comprehensively plotted using the dnascent2bedgraph utility,
- Migration from Tensorflow 1.14 to 2.4.1 and, correspondingly, GPU usage now requires CUDA 11 and cuDNN 8,
- Released with Totanes FIG, Gockel J, Chapman SE, Bartfai R, Boemo MA, Merrick CJ. Replication origin mapping in the malaria parasite Plasmodium falciparum. bioRxiv.
2.0.2
- Migration from HMM-based BrdU detection at every thymidine to ResNet-based detection at every thymidine,
- Significant increases to BrdU detection accuracy,
- Support for BrdU detection on GPUs,
- DNAscent forkSense to call replication origins and termination sites in both synchronously and asynchronously replicating cells at any point in S-phase,
- DNAscent align to align nanopore signals to reference,
- Significant increases to replication origin calling accuracy and sensitivity,
- Visualisation utility for plotting output of multiple DNAscent executables as bedgraphs,
- Released with Boemo, MA. DNAscent v2: Detecting Replication Forks in Nanopore Sequencing Data with Deep Learning. bioRxiv 2020. [Preprint Link]