Tutorial for general UNIX computers with docker

Download cromwell.

$ cd
$ wget https://github.com/broadinstitute/cromwell/releases/download/34/cromwell-34.jar
$ chmod +rx cromwell-34.jar

Git clone this pipeline and move into it.

$ cd
$ git clone https://github.com/ENCODE-DCC/atac-seq-pipeline
$ cd atac-seq-pipeline

Download a SUBSAMPLED (1/400) paired-end sample of ENCSR356KRQ.

$ wget https://storage.googleapis.com/encode-pipeline-test-samples/encode-atac-seq-pipeline/ENCSR356KRQ/ENCSR356KRQ_fastq_subsampled.tar
$ tar xvf ENCSR356KRQ_fastq_subsampled.tar

Download pre-built genome database for hg38.

$ wget https://storage.googleapis.com/encode-pipeline-genome-data/test_genome_database_hg38_atac.tar
$ tar xvf test_genome_database_hg38_atac.tar

Run a pipeline for the test sample.

$ INPUT=examples/local/ENCSR356KRQ_subsampled.json
$ PIPELINE_METADATA=metadata.json
$ java -jar -Dconfig.file=backends/backend.conf cromwell-34.jar run atac.wdl -i ${INPUT} -o workflow_opts/docker.json -m ${PIPELINE_METADATA}

It will take about an hour. You will be able to find all outputs on cromwell-executions/atac/[RANDOM_HASH_STRING]/. See output directory structure for details.
See full specification for input JSON file.
You can resume a failed pipeline from where it left off by using PIPELINE_METADATA(metadata.json) file. This file is created for each pipeline run. See here for details. Once you get a new input JSON file from the resumer, use it INPUT=resume.[FAILED_WORKFLOW_ID].json instead of INPUT=examples/local/ENCSR356KRQ_subsampled.json.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

tutorial_local_docker.md

tutorial_local_docker.md

Tutorial for general UNIX computers with docker

Files

tutorial_local_docker.md

Latest commit

History

tutorial_local_docker.md

File metadata and controls

Tutorial for general UNIX computers with docker