-
Notifications
You must be signed in to change notification settings - Fork 6
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Merge pull request #4 from kircherlab/develop
- Loading branch information
Showing
5 changed files
with
144 additions
and
25 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -15,4 +15,5 @@ analysis/* | |
!.test | ||
!LICENSE | ||
!README.md | ||
!CHANGELOG.md | ||
!*.smk |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,19 @@ | ||
# cfDNA-Workflow Changelog | ||
|
||
## develop | ||
|
||
## v0.1.2 | ||
|
||
### refactor | ||
|
||
- overlays.py | ||
- adding samples is now wrapped in a function | ||
- flanking regions are now determined dynamically | ||
|
||
### documentation | ||
|
||
- added README.md in config dir | ||
|
||
- updated README.md in root dir | ||
- updated input sections for workflows documentation | ||
- added links to .tsv files and README.md in config dir |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,47 @@ | ||
# Documentation for config files | ||
|
||
## config.yml | ||
|
||
The config.yml file configures values that should stay constant between samples. | ||
|
||
```yml | ||
samples: "config/samples.tsv" # .tsv file containing sample names and locations | ||
regions: "config/regions.tsv" # .tsv file containing bed files with regions of interest | ||
|
||
tissue: ["CACO.2", "MCF7", "PC.3"] # proteinAtlas tissues for generating plots in GE workflow | ||
refSample: "NPH001" # reference sample for rank correlation comparison | ||
minRL: 120 # minimum read length for calculating WPS | ||
maxRL: 180 # maximum read length for calculating WPS | ||
bpProtection: 120 # value for WPS window | ||
``` | ||
## samples.tsv | ||
The samples.tsv contains a header with four columns: | ||
```bash | ||
ID sample path ref_samples | ||
experimentID testsample1 "/path/to/testsample1.bam" testsample2,testsample3 | ||
experimentID testsample2 "/path/to/testsample2.bam" testsample1,testsample3 | ||
experimentID testsample3 "/path/to/testsample3.bam" testsample1,testsample2 | ||
``` | ||
|
||
- **ID** - ID for a certain analysis to create identifiable directories and/or filenames | ||
- **sample** - sample name used to identify files | ||
- **path** - path to input file | ||
- **ref_sample** - Reference sample for some visualizations/calculations. ref_samples are comma separated, must be in present in the sample column and every sample needs a ref_sample (e.g. itself). | ||
|
||
## regions.tsv | ||
|
||
The regions.tsv contains a header with two columns: | ||
|
||
```text | ||
target path | ||
gene1 /path/to/gene1.bed | ||
TF1 /pat/to/TF1BS.bed | ||
``` | ||
|
||
- **target** - describes the targets defined in the correspoding .bed file | ||
- **path** - path to input .bed file containing coordinates of interest (all coordinates should be centered around a specific feature and of same length) | ||
|
||
**Note:** .bed has to contain the first 6 fields (chrom, chromStart, chromEnd, name, value, strand), even though name and value are not actively used. |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters