diff --git a/analyses/tmb-calculation/README.md b/analyses/tmb-calculation/README.md index b6545bf0e6..7fcc46109d 100644 --- a/analyses/tmb-calculation/README.md +++ b/analyses/tmb-calculation/README.md @@ -17,10 +17,10 @@ These genomic regions are used to first filter mutations to these regions and th ### All mutations TMB For all mutation TMBs, consensus calls are used. -For WGS samples, the size of the genome covered by the intersection of Strelka2 and Mutect2's surveyed areas which are considered representative of all callers is used for the denominator. +For WGS samples, the size of the genome covered by the intersection of Strelka2, Mutect2, and Vardict's surveyed areas, which are considered representative of all callers, is used for the denominator. ``` -WGS_all_mutations_TMB = (total # mutations in consensus MAF) / intersection_strelka_mutect_genome_size +WGS_all_mutations_TMB = (total # mutations in consensus MAF) / intersection_strelka_mutect_vardict_genome_size ``` For WXS samples, the size of the genome for each the WXS bed region file is used for the denominator with the associated tumor samples. ``` @@ -33,7 +33,7 @@ This file is included in the OpenPedCan data download. SNVs outside of these coding sequences are filtered out before being summed and used for TMB calculations as follows: ``` -WGS_coding_only_TMB = (total # coding mutations in consensus MAF) / intersection_wgs_strelka_mutect_CDS_genome_size +WGS_coding_only_TMB = (total # coding mutations in consensus MAF) / intersection_wgs_strelka_mutect_vardict_CDS_genome_size ``` For WXS samples, each the WXS bed region file is intersected with the coding sequences for filtering and for determining the denominator to be used with the with the associated tumor samples. ``` diff --git a/analyses/tmb-calculation/run_tmb_calculation.sh b/analyses/tmb-calculation/run_tmb_calculation.sh index 2999559186..c6c092eb47 100755 --- a/analyses/tmb-calculation/run_tmb_calculation.sh +++ b/analyses/tmb-calculation/run_tmb_calculation.sh @@ -32,6 +32,8 @@ histology_file="${data_dir}/histologies-base.tsv" ############# Create intersection BED files for TMB calculations ############### # Make All mutations BED files +# NOTE: `hg38_strelka.bed` is equivalent to `WGS.hg38.strelka2.unpadded.bed` +# and `wgs_canonical_calling_regions.hg38.bed` is equivalent to `WGS.hg38.mutect2.vardict.unpadded.bed` bedtools intersect \ -a ${data_dir}/hg38_strelka.bed \ -b ${data_dir}/wgs_canonical_calling_regions.hg38.bed \