-
Notifications
You must be signed in to change notification settings - Fork 37
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add Kraken2 #36
Comments
It was to look for contamination in the reads in a quick and unbiased way across organisms. |
I'm not really sure this in the scope of a demultiplexing workflow. Sounds to me like that's more like extensive input QC to be done before starting a wf. |
I guess the thought process behind this was that the raw reads come out of the demultiplexing workflow so it would be good to add this as an option here as "pre-pipeline" QC (like FastQC) as this sort of information could also be useful to Sequencing facilities that may be interested in the results of the demultiplexing. I suspect it's more effort to add Kraken2 to all individual pipelines but also not fussed if this isn't added here. The biggest complication is sourcing a Kraken2 databases that is comprehensive enough to contain "most" species for contamination detection. |
Be awesome to have contamination detection built-in here though. |
No problem, we can keep te issue around for the future |
Also, does kraken support remote db's or does it have to be local? |
Hey @matthdsm - I have a subworkflow already fleshed out that does allow even to subsample fastqs using seqtk (unbiased) to speed up the process and then runs kraken2 on it, creating a report that can be fed into MultiQC directly. We use this internally already, will contribute now to subworkflows in modules and then we can simply take it from there 👍🏻 |
See a PR to add subworkflow for this - we use this already in our own, but I'm attempting to contribute and recycle subworkflows from nf-core to make things easier 👍🏻 |
Hi! We actually have a use case for this at our platform and would like to include @apeltzer 's subworkflow into nf-core/demultiplex. Is that ok if we start working on this during the Hackathon? |
This is now implemented via #220 - will be part of 1.5.0 release |
Not sure what was going on with this one before, maybe @csawye01 or @drpatelh can shed some light here.
The text was updated successfully, but these errors were encountered: