🛠️ Modification to bin cobalt outputs and add purity forcing #3

ddomenico · 2024-08-06T20:41:08Z

Several modifications in order to improve workflow for routine use.

Added binning in order to decrease oversegmentation by grouping together combinations of probes with similar logR values.
Added option to specify min/max purity and regenerate purple solution.
Added check to not rerun amber/cobalt if results previously generated.

juanesarango

Awesome man! This will clean the profiles a lot. Great job.

juanesarango · 2024-08-07T17:53:33Z

main.nf

- -ref_genome_version ${params.genomeVersion}
+ if [ -f "${params.outdir}/amber/${tumor}.amber.baf.tsv.gz" ] && \
+ [ -f "${params.outdir}/amber/${tumor}.amber.baf.pcf" ] && \
+ [ -f "${params.outdir}/amber/${tumor}.amber.qc" ]; then


These checks are managed by nextflow. If nextflow sees the outputs you expect in a previous run, it will cached these step.

Was this not happening for you?

This is part of the updates for the force -- because the workflow has finished completely the runDir is already copied to the publishDir and it goes to redo these steps since the output files don't exist in the new runDir.

juanesarango · 2024-08-07T17:57:00Z

main.nf

+ last_idx = cobalt_ratio_pcf_probes_logR.index[-1]
+
+ cobalt_ratio_pcf_probes_logR.to_csv("${tumor}.cobalt.ratio.pcf", sep='\\t', index=False)
+ """


Add here some """.stripIndent() to the script so the script is properly indented when added to a file.

See the other processes.

Thanks I missed this -- surprised this didn't cause an error.

juanesarango · 2024-08-07T18:17:53Z

main.nf

@@ -1,4 +1,5 @@
-params.cores = 4
+params.cores = 1 
+params.memory = '4 GB'


Im thinking about keeping 32 GB, and use 4 GB for testing in nf.test:

params { memory = '4 GB' tumor = "TEST" binProbes = 100 binLogR = 0.5 cobalt_ratio_pcf = "${projectDir}/tests/outdir/cobalt/TEST.cobalt.ratio.pcf" cobalt_ratio_tsv = "${projectDir}/tests/outdir/cobalt/TEST.cobalt.ratio.tsv.gz" }

Unless you think it doesn't need that much memory by default?

I ended up adding a memory parameter because it seems sample/system dependent. I prefer to keep it low and then configure it higher on samples or systems where necessary but either way works.

ddomenico added 10 commits July 24, 2024 17:08

✨ add binning logic

a56dd1a

🏷️ change names and add new params to logs

04c403a

🛠️ multiple changes to support binning and purity force

e033572

✅ fix tests

6270a40

🛠️ add memory parameter

b763c14

✅ reduce default memory to fix tests

b4e9645

🐳 update docker and add test for binning

cb86daf

✅ fix full workflow test

8bb86cb

✅ fix ci

80fa8ae

✅ add bin test to ci

f64a0ae

ddomenico requested a review from juanesarango August 7, 2024 15:30

juanesarango approved these changes Aug 7, 2024

View reviewed changes

ddomenico added 2 commits August 7, 2024 14:36

🛠️ add indentation and fix snapshot

f4cafa5

📝 update README and snapshots

4d5f1ef

ddomenico merged commit 1f9517c into main Aug 13, 2024
1 check passed

ddomenico deleted the binning-mod branch August 13, 2024 02:01

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

🛠️ Modification to bin cobalt outputs and add purity forcing #3

🛠️ Modification to bin cobalt outputs and add purity forcing #3

ddomenico commented Aug 6, 2024

juanesarango left a comment

juanesarango Aug 7, 2024

ddomenico Aug 7, 2024

juanesarango Aug 7, 2024

ddomenico Aug 7, 2024

juanesarango Aug 7, 2024 •

edited

Loading

ddomenico Aug 7, 2024

🛠️ Modification to bin cobalt outputs and add purity forcing #3

🛠️ Modification to bin cobalt outputs and add purity forcing #3

Conversation

ddomenico commented Aug 6, 2024

juanesarango left a comment

Choose a reason for hiding this comment

juanesarango Aug 7, 2024

Choose a reason for hiding this comment

ddomenico Aug 7, 2024

Choose a reason for hiding this comment

juanesarango Aug 7, 2024

Choose a reason for hiding this comment

ddomenico Aug 7, 2024

Choose a reason for hiding this comment

juanesarango Aug 7, 2024 • edited Loading

Choose a reason for hiding this comment

ddomenico Aug 7, 2024

Choose a reason for hiding this comment

juanesarango Aug 7, 2024 •

edited

Loading