Feature Request - add seqs_per_slice=1000 when convert to cram #91

lindaxiang · 2020-03-17T19:31:20Z

Detailed Description

As indicated in wiki: https://wiki.oicr.on.ca/display/icgcargotech/Data+Management+Tasks
The way the CRAM is generated can minimise compute overhead, see samtools, use seqs_per_slice=1000 instead of the default. Increase in file size is negligible, but has a large impact on random access.

Possible Implementation

Should add seqs_per_slice=1000 at
https://github.com/icgc-argo/dna-seq-processing-tools/blob/master/tools/bam-merge-sort-markdup/bam-merge-sort-markdup.py#L62

lindaxiang added the new-feature Request is a new feature label Mar 17, 2020

lindaxiang assigned rosibaj Mar 17, 2020

junjun-zhang mentioned this issue Apr 2, 2020

Bam merge sort markdup.0.1.9.0 #92

Closed

junjun-zhang unassigned rosibaj Jun 23, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature Request - add seqs_per_slice=1000 when convert to cram #91

Feature Request - add seqs_per_slice=1000 when convert to cram #91

lindaxiang commented Mar 17, 2020

Feature Request - add seqs_per_slice=1000 when convert to cram #91

Feature Request - add seqs_per_slice=1000 when convert to cram #91

Comments

lindaxiang commented Mar 17, 2020

Detailed Description

Possible Implementation