Introduction to Peak Analysis

Learning Objectives

Describe peak data and different file formats generated from peak calling algorithms
Assess various metrics used to assess the quality of peak calls
Compare peak calls across samples within a dataset
Create visualizations to evaluate peak annotations
Evaluate differentially enriched regions between two sample groups

Installations

On your desktop

R
RStudio
Integrative Genomics Viewer (IGV)
The listed R packages

On your HPCC (if not using Harvard's O2 cluster)

Required

Nextflow version 24.11.0-edge

Alternative to Nextflow

samtools version 1.15.1
bedtools version 2.30.0
Picard version 2.27.5
phantompeakqualtools version 1.2.2
deepTools version 3.5.6
bedGraphToBigWig version 302.1

NOTE: If you are not working on the O2 cluster and are using different versions of these software programs, these packages may still work with the provided commands. However, this workshop was designed on these versions specifically, so you may need to tweak some of the commands if you use different versions of this software.

Lessons

Workflow overview: From sequenced reads to peaks
Existing workflows for ChIP-seq analysis
Understanding peaks and peak file formats
Assessing peak quality metrics
Assessing sample similarity and identifying potential outliers
Concordance across replicates using peak overlaps
Peak annotation and visualization using ChIPseeker
Differential enrichment analysis using DiffBind
Peak visualization using IGV
Annotation and functional analysis of DE regions
Motif analysis/discovery

NOTE: If you aren't working on Harvard's O2 cluster the directory structure for the HPCC that you are using is likely different and you will need to modify paths to work within your HPCC's directory structure.

Answer key

Day 1 exercises
Day 2 exercises

These materials have been developed by members of the teaching team at the Harvard Chan Bioinformatics Core (HBC). These are open access materials distributed under the terms of the Creative Commons Attribution license (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

self-learning.md

self-learning.md

Introduction to Peak Analysis

Learning Objectives

Installations

On your desktop

On your HPCC (if not using Harvard's O2 cluster)

Required

Alternative to Nextflow

Lessons

Answer key

Files

self-learning.md

Latest commit

History

self-learning.md

File metadata and controls

Introduction to Peak Analysis

Learning Objectives

Installations

On your desktop

On your HPCC (if not using Harvard's O2 cluster)

Required

Alternative to Nextflow

Lessons

Answer key