diff --git a/.DS_Store b/.DS_Store index 8c4dc09..15577cc 100644 Binary files a/.DS_Store and b/.DS_Store differ diff --git a/README.html b/README.html new file mode 100644 index 0000000..293bab6 --- /dev/null +++ b/README.html @@ -0,0 +1,481 @@ + + + + +
+ + + + + + + + +This repository was created to maintain version control of this +pharmacogenomics project at Karolinska Institutet (KI) under the +supervision of Volker M. Lauschke and Yitian Zhou.
+Human UDP-glycosyltransferases (UGTs) are responsible for the +glycosylation of a wide variety of endogenous substrates and commonly +prescribed drugs. Different genetic polymorphisms in UGT genes are +implicated in interindividual differences in drug response and cancer +risk. However, the genetic complexity beyond these variants has not been +comprehensively assessed. We here leveraged whole-exome and whole-genome +sequencing data from 141,456 unrelated individuals across 7 major human +populations to provide a comprehensive profile of genetic variability +across the human UGT gene family. Overall, 9666 exonic variants were +observed of which 98.9% were rare. To interpret the functional impact of +UGT missense variants, we developed a gene family-specific variant +effect predictor. This algorithm identified a total of 1208 deleterious +variants, most of which were found in African and South Asian +populations. Structural analysis corroborated the predicted effects for +multiple variations in substrate binding sites. Combined, our analyses +provide a systematic overview of UGT variability, which can yield +insights into interindividual differences in phase 2 metabolism and +facilitate the translation of sequencing data into personalized +predictions of UGT substrate disposition.
++ +
+Global landscape of human UGT variability. A: +Schematic representation of the canonical transcripts of the UGT1A and +UGT2A loci, including gene-specific and shared exons. +B: The total number of variants across the four human +UGT gene families is shown. C: The distribution of +variant types is shown for all exonic variants. D: Of +all exonic UGT variants, 98.9% and 97.4% were rare or very rare with +global MAF <1% and <0.1%, respectively. Furthermore, 59.8% were +singletons found only in one individual. E: Observed +against expected missense-to-synonymous variant ratio indicates the +mutational constraints of all human UGT genes. Abbreviations: MAF: minor +allele frequency.
++ +
+Ethnogeographic variability within the human UGT gene family. +A: Frequency distribution of the UGT1A1 promoter polymorphism +across seven human populations. B: MAFs of putatively +deleterious variants across all UGT genes. Only the variants that map to +gene-specific exons are shown for each gene. Variants localized in +shared exons of the UGT1A or UGT2A loci are shown as UGT1A[1−10] and +UGT2A[1−2], respectively. C: The average number of +deleterious UGT variants per individual is shown for different +populations. The eight variants responsible for the largest +contributions to overall functional variability are shown +individually.
+No raw data were generated in this project. All used data can be +directly downloaded from the corresponding cited sources (gnomAD, +Ensembl, ClinVar, AlphaMissense, and COSMIC).
+All code run in R/Bash required to perform the analyses and produce +the figures and results can be found in the code +section where they are carefully described.
+++González-Padilla, D., Camara, M. D., Lauschke, V. M., & Zhou, Y. +(2024). Population-scale variability of the human +UDP-glycosyltransferase gene family. Journal of Genetics and Genomics. +doi: https://doi.org/10.1016/j.jgg.2024.06.018
+
+ +
+ +**Global landscape of human UGT variability. A**: Schematic representation of the canonical transcripts of the UGT1A and UGT2A loci, including gene-specific and shared exons. **B**: The total number of variants across the four human UGT gene families is shown. **C**: The distribution of variant types is shown for all exonic variants. **D**: Of all exonic UGT variants, 98.9% and 97.4% were rare or very rare with global MAF <1% and <0.1%, respectively. Furthermore, 59.8% were singletons found only in one individual. **E**: Observed against expected missense-to-synonymous variant ratio indicates the mutational constraints of all human UGT genes. +Abbreviations: MAF: minor allele frequency. + ++ +
+ +**Ethnogeographic variability within the human UGT gene family. A**: Frequency distribution of the UGT1A1 promoter polymorphism across seven human populations. **B**: MAFs of putatively deleterious variants across all UGT genes. Only the variants that map to gene-specific exons are shown for each gene. Variants localized in shared exons of the UGT1A or UGT2A loci are shown as UGT1A[1−10] and UGT2A[1−2], respectively. **C**: The average number of deleterious UGT variants per individual is shown for different populations. The eight variants responsible for the largest contributions to overall functional variability are shown individually. + +## Data access +No raw data were generated in this project. All used data can be directly downloaded from the corresponding cited sources (gnomAD, Ensembl, ClinVar, AlphaMissense, and COSMIC). + +## Code availability +All code run in R/Bash required to perform the analyses and produce the figures and results can be found in the [code](code/) section where they are carefully described. + + +## Article citation +> González-Padilla, D., Camara, M. D., Lauschke, V. M., & Zhou, Y. (2024). Population-scale variability of the human UDP-glycosyltransferase gene family. Journal of Genetics and Genomics. doi: https://doi.org/10.1016/j.jgg.2024.06.018 + + + diff --git a/images/fig1.jpg b/images/fig1.jpg new file mode 100644 index 0000000..4cb6042 Binary files /dev/null and b/images/fig1.jpg differ diff --git a/images/fig4.jpg b/images/fig4.jpg new file mode 100644 index 0000000..718044d Binary files /dev/null and b/images/fig4.jpg differ