All test samples and genome data are shared on our public DNAnexus project. You don't have to download any data for testing our pipeline on DNAnexus platform.
There are two methods to run our pipeline on DNAnexus.
- Building your own DX workflow from
atac.wdl
with dxWDL (CLI) - Using a pre-built DX workflow on our public DX project (Web UI)
This document describes instruction for the item 1).
-
Sign up for a DNAnexus account.
-
Create a new DX project with name
[YOUR_PROJECT_NAME]
by clicking on "+New Project" on the top left. -
Download dxWDL.
$ cd $ wget https://github.com/DNAnexus/dxWDL/releases/download/0.77/dxWDL-0.77.jar $ chmod +rx dxWDL-0.77.jar
-
Git clone this pipeline.
$ cd $ git clone https://github.com/ENCODE-DCC/atac-seq-pipeline
-
Move to pipeline's directory.
$ cd atac-seq-pipeline
-
Choose an appropriate input for your project (AWS or Azure):
- AWS
$ INPUT=examples/dx/ENCSR356KRQ_subsampled_dx.json
- Azure
$ INPUT=examples/dx_azure/ENCSR356KRQ_subsampled_dx_azure.json
- AWS
-
Compile
atac.wdl
with an input JSON for the SUBSAMPLED (1/400) paired-end sample of ENCSR356KRQ.$ PROJECT=[YOUR_PROJECT_NAME] $ OUT_FOLDER=/test_sample_atac_ENCSR356KRQ_subsampled $ java -jar dxWDL-0.77.jar compile atac.wdl -project ${PROJECT} -f -folder ${OUT_FOLDER} -defaults ${INPUT} -extras workflow_opts/docker.json
-
Go to DNAnexus project page and click on your project.
-
Move to the directory
/test_sample_atac_ENCSR356KRQ_subsampled
. -
You will find a DX workflow
atac
with all parameters pre-defined. Click on it. -
Specify an output directory by clicking "Workflow Actions" on the top right. Click on "Set output folder" and choose an output folder.
-
Click on "Run as Analysis..." and you will be automatically redirected to the "Monitor" tab.
-
It will take about an hour. You will be able to find all outputs on your output folder. Final QC report (
qc.html
)/JSON (qc.json
) will be found on it. -
See full specification for input JSON file.