Bulk RNAseq workshop

Repository for bulk RNAseq course of the Danish Health Data Science Sandbox project.

This course is an introduction to how to approach bulk RNAseq data, starting from the sequencing reads. It will provide an overview of the fundamentals of RNAseq analysis, including read preprocessing, data normalization, data exploration with PCAs and heatmaps, performing differential expression analysis, and annotation of the differentially expressed genes. Participants will also learn how to evaluate confounding and batch effects in the data. The course will further touch upon laboratory protocols, library preparation, and experimental design of RNA sequencing experiments, especially about how they influence downstream bioinformatic analysis.

This workshop is based on the materials developed by members of the teaching team at the Harvard Chan Bioinformatics Core (HBC), a collection of modified tutorials from the DESeq2, R language vignettes and the nf-core pipeline for bulk RNAseq.

Goals

By the end of this workshop, you should be able to analyse your own bulk RNAseq count matrix:

Preprocess your reads.
Normalize your data.
Explore your samples with PCAs and heatmaps.
Perform Differential Expression Analysis.
Annotate your results.

Syllabus

Introduction to bulk-RNASeq
Experimental planning
Intro to the data
Preprocessing your reads
RNAseq data
Exploratory analysis
Differential Expression Analysis
Functional Analysis
Summarized workflow

Workshop requirements

Knowledge of R, Rstudio, and Rmarkdown. It is recommended that you have at least followed our workshop R basics
Basic knowledge of RNAseq technology
Basic knowledge of data science and statistics such as PCA, clustering and statistical testing

Intended use

The aim of this repository is to run a comprehensive but introductory workshop on bulk-RNAseq bioinformatic analyses. Each of the modules of this workshop is accompanied by a powerpoint slideshow explaining the steps and the theory behind a typical bioinformatics analysis (ideally with a teacher). Many of the slides are annotated with extra information and/or point to original sources for additional reading material.

A version of the slides from 2024 can be found in this zenodo repository.

Acknowledgements

Center for Health Data Science, University of Copenhagen.
Hugo Tavares, Bioinformatics Training Facility, University of Cambridge.
Silvia Raineri, Center for Stem Cell Medicine (reNew), University of Copenhagen.
Harvard Chan Bioinformatics Core (HBC), check out their github repo
Adrija Kalvisa
nf-core community

Name		Name	Last commit message	Last commit date
Latest commit History 142 Commits
.github/workflows		.github/workflows
Notebooks		Notebooks
Scripts		Scripts
.gitignore		.gitignore
LICENSE.md		LICENSE.md
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Bulk RNAseq workshop

Goals

Syllabus

Workshop requirements

Intended use

Acknowledgements

About

Releases 10

Packages

Contributors 5

Languages

License

hds-sandbox/bulk_RNAseq_course

Folders and files

Latest commit

History

Repository files navigation

Bulk RNAseq workshop

Goals

Syllabus

Workshop requirements

Intended use

Acknowledgements

About

Topics

Resources

License

Stars

Watchers

Forks

Releases 10

Packages 0

Contributors 5

Languages

Packages