You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hello,
We are using Cromwell on a Google Cloud Platform to analyze sequencing data. We found that, at least for bwa and WGS, scattering the reads in small chunks across several nodes with few cpus is cheaper than running the entire sample on one node with many cpus, when using preemptible instances.
From the documentation I saw that for gemBS the recommendation is to run all steps on one machine with multiple cpus for parallelization. My question is whether it is possible to perform the map step of gemBS across many independent instances, and then merge the resulting bams using gemBS.
Thank you,
Gil
The text was updated successfully, but these errors were encountered:
In principle yes (we don't use cromwell, but we have a workflow based on
Slurm which splits the run across a cluster without any major problems).
If there is a shared file system for all nodes then it should work pretty
much without modification, otherwise you'll have to copy the bams for each
chunk into the same directory and use the --no-db option to gemBS for the
merge step.
Simon
On Wed, Jul 7, 2021 at 2:31 PM gilhornung ***@***.***> wrote:
Hello,
We are using Cromwell on a Google Cloud Platform to analyze sequencing
data. We found that, at least for bwa and WGS, scattering the reads in
small chunks across several nodes with few cpus is cheaper than running the
entire sample on one node with many cpus, when using preemptible instances
<https://cloud.google.com/compute/docs/instances/preemptible>.
From the documentation I saw that for gemBS the recommendation is to run
all steps on one machine with multiple cpus for parallelization. My
question is whether it is possible to perform the map step of gemBS
across many independent instances, and then merge the resulting bams using
gemBS.
Thank you,
Gil
—
You are receiving this because you are subscribed to this thread.
Reply to this email directly, view it on GitHub
<#84>, or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AAY465Z7NQFKJU4QVSD3JM3TWRCKNANCNFSM476SG54Q>
.
Hello,
We are using Cromwell on a Google Cloud Platform to analyze sequencing data. We found that, at least for bwa and WGS, scattering the reads in small chunks across several nodes with few cpus is cheaper than running the entire sample on one node with many cpus, when using preemptible instances.
From the documentation I saw that for gemBS the recommendation is to run all steps on one machine with multiple cpus for parallelization. My question is whether it is possible to perform the
map
step of gemBS across many independent instances, and then merge the resulting bams using gemBS.Thank you,
Gil
The text was updated successfully, but these errors were encountered: