Skip to content

Commit

Permalink
Merge pull request #408 from d3b-center/update-tmb-calc
Browse files Browse the repository at this point in the history
Clarify SNV callers included in intersect bed for WGS samples
  • Loading branch information
jharenza authored Jun 23, 2023
2 parents 8fe8c5a + a1b54bc commit 74f7376
Show file tree
Hide file tree
Showing 2 changed files with 5 additions and 3 deletions.
6 changes: 3 additions & 3 deletions analyses/tmb-calculation/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -17,10 +17,10 @@ These genomic regions are used to first filter mutations to these regions and th
### All mutations TMB

For all mutation TMBs, consensus calls are used.
For WGS samples, the size of the genome covered by the intersection of Strelka2 and Mutect2's surveyed areas which are considered representative of all callers is used for the denominator.
For WGS samples, the size of the genome covered by the intersection of Strelka2, Mutect2, and Vardict's surveyed areas, which are considered representative of all callers, is used for the denominator.

```
WGS_all_mutations_TMB = (total # mutations in consensus MAF) / intersection_strelka_mutect_genome_size
WGS_all_mutations_TMB = (total # mutations in consensus MAF) / intersection_strelka_mutect_vardict_genome_size
```
For WXS samples, the size of the genome for each the WXS bed region file is used for the denominator with the associated tumor samples.
```
Expand All @@ -33,7 +33,7 @@ This file is included in the OpenPedCan data download.
SNVs outside of these coding sequences are filtered out before being summed and used for TMB calculations as follows:

```
WGS_coding_only_TMB = (total # coding mutations in consensus MAF) / intersection_wgs_strelka_mutect_CDS_genome_size
WGS_coding_only_TMB = (total # coding mutations in consensus MAF) / intersection_wgs_strelka_mutect_vardict_CDS_genome_size
```
For WXS samples, each the WXS bed region file is intersected with the coding sequences for filtering and for determining the denominator to be used with the with the associated tumor samples.
```
Expand Down
2 changes: 2 additions & 0 deletions analyses/tmb-calculation/run_tmb_calculation.sh
Original file line number Diff line number Diff line change
Expand Up @@ -32,6 +32,8 @@ histology_file="${data_dir}/histologies-base.tsv"

############# Create intersection BED files for TMB calculations ###############
# Make All mutations BED files
# NOTE: `hg38_strelka.bed` is equivalent to `WGS.hg38.strelka2.unpadded.bed`
# and `wgs_canonical_calling_regions.hg38.bed` is equivalent to `WGS.hg38.mutect2.vardict.unpadded.bed`
bedtools intersect \
-a ${data_dir}/hg38_strelka.bed \
-b ${data_dir}/wgs_canonical_calling_regions.hg38.bed \
Expand Down

0 comments on commit 74f7376

Please sign in to comment.