Skip to content

NBISweden/tissue-adherent-VMB

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Distinct cervical tissue-adherent and luminal microbiome communities correlate with mucosal host gene expression and protein levels in Kenyan sex workers

General info

This repository contains the code related to the article "Distinct cervical tissue-adherent and luminal microbiome communities correlate with mucosal host gene expression and protein levels in Kenyan sex workers" Edfeldt, Kaldhusdal et al (2023)

This project used multiple datasets:

  • Transcriptomics data (bulk mRNA-SEQ)
  • Microbiota data (16S)
  • Protein data (bead-based affinity assay)
  • Cytokine data

The analysis can be replicated by installing conda and running the individual rmarkdown scripts.
The WORKFLOW.md file describes the input and output of each rmarkdown script.
Raw counts for the transcriptomics data is downloaded from GEO and the raw counts for the microbiota is downloaded from supplement files in the rmarkdown script named 03_normalize_data.Rmd. Normalized MFI protein values and the cytokine values are downloaded in Figure_06.Rmd.

Data Availability Statement

Microbiome sequencing data for this study has been deposited in the European Nucleotide Archive (ENA) at EMBL-EBI under accession number PRJEB50325. The raw counts table with taxonomic annotation can be found in Suppl. Table 1

Transcriptomics count data can be accessed in the Gene Expression Omnibus public repository, SuperSeries ID GSE217237. The raw transcriptomic sequencing data cannot be held in a public repository due to the sensitive nature of such personal data. Request for data access can be made to the Karolinska Institutet Research Data Office (contact via [email protected]), and access will be granted if the request meets the requirements of the data policy.

Protein data is previously published in doi: 10.1371/journal.ppat.1010494
Cytokine data Can be found in Suppl. Table 14
Sociodemographic and clinical characteristics Can be found in Suppl. Table 2

Supplementary material can be downloaded here

Dependencies

Project is created with:
* R version: 3.6.2
* RStudio version: 2022.07.1+554

Setup Conda

  1. Clone the repo
  2. If not already installed download mini conda/conda
  3. In the terminal navigate to the project directory
  4. create a new environment:
    conda env create -n tissue_adherent_VMB -f ./resources/environment_tissue_adherent_VMB.yml
  5. Activate the environment:
    conda activate tissue_adherent_VMB
  6. Open your preferred text editor from terminal by :
    /path/to/rstudio &
  7. Install niceRplots package from github:
    remotes::install_github("czarnewski/niceRplots",force=T)

Repo structure

project
│   README.md
│  
└───code
│   │   differential_abundance.R
│   │   enrichment_function.R 
│   │   knit_function.R
│   │   Picrust_pipe.sh
└───data
│ 
└───reports
│   │   00_Preprocessing.Rmd
│   │   02_Analysis.Rmd
│   │   ...
│   └───manuscript
│       │   Figure01.Rmd
│       │   Figure02.Rmd
│       │   ...
│       └───Figures
│           │  Figure 1.jpeg 
│           │   ...
└───resources
│   └───KEGG_GO_database
│   └───OptiVag_BVAB_database
│   │   ...
│ 

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published