Skip to content

Commit

Permalink
Improve main README
Browse files Browse the repository at this point in the history
  • Loading branch information
daianna21 committed Aug 28, 2024
1 parent ff7f32d commit da8b486
Show file tree
Hide file tree
Showing 5 changed files with 513 additions and 2 deletions.
Binary file modified .DS_Store
Binary file not shown.
481 changes: 481 additions & 0 deletions README.html

Large diffs are not rendered by default.

34 changes: 32 additions & 2 deletions README.md
Original file line number Diff line number Diff line change
@@ -1,2 +1,32 @@
# UGT_genetic_profiling_KI
This repository was created to maintain version control of the pharmacogenomics KI project that aims to analyze genetic variants in UGT genes across different human populations.
# Population-scale variability of the human UDP-glycosyltransferase gene family
This repository was created to maintain version control of this pharmacogenomics project at Karolinska Institutet (KI) under the supervision of Volker M. Lauschke and Yitian Zhou.


## Abstract
Human UDP-glycosyltransferases (UGTs) are responsible for the glycosylation of a wide variety of endogenous substrates and commonly prescribed drugs. Different genetic polymorphisms in UGT genes are implicated in interindividual differences in drug response and cancer risk. However, the genetic complexity beyond these variants has not been comprehensively assessed. We here leveraged whole-exome and whole-genome sequencing data from 141,456 unrelated individuals across 7 major human populations to provide a comprehensive profile of genetic variability across the human UGT gene family. Overall, 9666 exonic variants were observed of which 98.9% were rare. To interpret the functional impact of UGT missense variants, we developed a gene family-specific variant effect predictor. This algorithm identified a total of 1208 deleterious variants, most of which were found in African and South Asian populations. Structural analysis corroborated the predicted effects for multiple variations in substrate binding sites. Combined, our analyses provide a systematic overview of UGT variability, which can yield insights into interindividual differences in phase 2 metabolism and facilitate the translation of sequencing data into personalized predictions of UGT substrate disposition.

<p align="center">
<img src= "images/fig1.jpg" width="800" >
</p>

**Global landscape of human UGT variability. A**: Schematic representation of the canonical transcripts of the UGT1A and UGT2A loci, including gene-specific and shared exons. **B**: The total number of variants across the four human UGT gene families is shown. **C**: The distribution of variant types is shown for all exonic variants. **D**: Of all exonic UGT variants, 98.9% and 97.4% were rare or very rare with global MAF <1% and <0.1%, respectively. Furthermore, 59.8% were singletons found only in one individual. **E**: Observed against expected missense-to-synonymous variant ratio indicates the mutational constraints of all human UGT genes.
Abbreviations: MAF: minor allele frequency.

<p align="center">
<img src= "images/fig4.jpg" width="800" >
</p>

**Ethnogeographic variability within the human UGT gene family. A**: Frequency distribution of the UGT1A1 promoter polymorphism across seven human populations. **B**: MAFs of putatively deleterious variants across all UGT genes. Only the variants that map to gene-specific exons are shown for each gene. Variants localized in shared exons of the UGT1A or UGT2A loci are shown as UGT1A[1−10] and UGT2A[1−2], respectively. **C**: The average number of deleterious UGT variants per individual is shown for different populations. The eight variants responsible for the largest contributions to overall functional variability are shown individually.

## Data access
No raw data were generated in this project. All used data can be directly downloaded from the corresponding cited sources (gnomAD, Ensembl, ClinVar, AlphaMissense, and COSMIC).

## Code availability
All code run in R/Bash required to perform the analyses and produce the figures and results can be found in the [code](code/) section where they are carefully described.


## Article citation
> González-Padilla, D., Camara, M. D., Lauschke, V. M., & Zhou, Y. (2024). Population-scale variability of the human UDP-glycosyltransferase gene family. Journal of Genetics and Genomics. doi: https://doi.org/10.1016/j.jgg.2024.06.018


Binary file added images/fig1.jpg
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added images/fig4.jpg
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.

0 comments on commit da8b486

Please sign in to comment.