[Replace the contents this README.MD file with the appropriate "Users manual" needed for the workflow.]
This page gives details concerning guiding principles and formatting required for private analysis workflows and reproducing papers.
-
The
run.py
is the configurableAnadama2 workflow
example file showing the initialization, creation, addition of the workflow tasks of Python and R modules including its PDF report generation. -
The
/input
directory contains the sample input files and Python/R analysis modules. Each of the python analysis should be a standalone command line executable module. In the example file, the module is packaged usingArgumentParser
-
The
/doc
directory contains the documentation usage on the executable modules and the template for the report generation. -
The
/pregenerated-output
directory contains all the example output files generated by the template. Theanadama.log
are included in theoutput
directory after a run. -
The
etc/config.ini
directory contains the default config setting for the template demo run. Feel free to change the settings based on the workflow use case. -
The /src directory is the main source directory for all the
python/R analysis
executables for the workflow.
Follow this installation guide HERE to install Git in your local machine. Additionally, direct to Github sign up page if you do not have the account.
-
anadama2
-
pandas
-
logging
-
numpy
-
matplotlib
-
docopt
-
tidyverse
-
reshape2
-
logging
Getting started with the Analysis Workflow Template:
-
Direct to Analysis-workflow-template github page.
-
Click on "Use this Template" button and fill out the new repository informations (name/description/access).
-
Finally click on "Create repository from template" and you now have a new repository based on the Analysis workflow Template following the standard layouts.
Alternatively,
-
Direct to Github Create New Repository and select the "biobakery/Analysis-workflows-template". Fill out other information (Repository name/description/access)
-
Finally click on "Create repository" and you now have a new repository based on the Private Analysis workflow template.
NOTE: Make your repository "Private" unless it is ready to be released publicly.
- Clone your recently created repository in your local development environment either using:
git clone https://github.com/biobakery/<Name of your repository>
or using the "Clone or Download" button.
python run.py --input input/ --output output --lines 10 --metadata input/metadata.tsv
python run.py --help
usage: run.py [-h] [--version] [--lines LINES]
[--sample-metadata SAMPLE_METADATA] -o OUTPUT [-i INPUT]
[--config CONFIG] [--local-jobs JOBS] [--grid-jobs GRID_JOBS]
[--grid GRID] [--grid-partition GRID_PARTITION]
[--grid-benchmark {on,off}] [--grid-options GRID_OPTIONS]
[--grid-environment GRID_ENVIRONMENT]
[--grid-scratch GRID_SCRATCH] [--dry-run] [--skip-nothing]
[--quit-early] [--until-task UNTIL_TASK]
[--exclude-task EXCLUDE_TASK] [--target TARGET]
[--exclude-target EXCLUDE_TARGET]
[--log-level {DEBUG,INFO,WARNING,ERROR,CRITICAL}]
The etc/config.ini
directory contains the default configuration setting for the template demo run. Feel free to change the settings based on the workflow use case.
The docs/template.py
is a sample Pweave example template (Markdown + Python) for the PDF report generation of the existing tasks/Python/R modules.
https://github.com/biobakery/biobakery/wiki/biobakery_advanced#5-build-biobakery-google-cloud