Brigham Young University, Bio 465 - Capstone Project

Analyzing variance of splice variants in colorectal cancer from nanopore reads

Jonathan Dayton, Lucas Pinto, and PJ Tatlow

Data

We do not own the data we used for our analysis, and as such, we could not make the data publicly available for download. Please contact Dr. Marco Gerlinger for more information.

Statistical analysis

Code to run our statistical analysis, and the generated output files from that code, are found in the stats/ directory. We recommend RStudio for editing and running the .Rmd files.

Alignment

The first step is to align the data to a reference transcriptome and quantify each transcript. The scripts in the alignment directory are used to do this, producing SAM files, and in the case of Kallisto and BWA, producing quantifications. We did not quantify the output of Graphmap due to the issues discusses in the paper.

Data Quality

The scripts in the data_quality directory were used to assess the quality of the raw data and of the alignments.

fastqStats.py summarizes the average/standard deviations for length and number of reads of all FASTQ files.
getMAPQ.sh prints the MAPQ score of each alignment in the BWA-MEM filtered alignment file.
qualityScores.py prints the number of bases called with each quality score.
readLengths.py prints the length of each read from each FASTQ on it's own line.
PlotAlignmentQuality.R creates a simple histogram of the MAPQ files, which we used to determine the quality of the alignment (the output of getMAPQ.sh).
PlotQualityScores.R creates a histogram of the quality scores of each base (the output of qualityScores.py).
PlotReadLengths.R creates a histogram of the read lengths (the output of readLengths.py).

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
alignment		alignment
analysis		analysis
data_quality		data_quality
stats		stats
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Brigham Young University, Bio 465 - Capstone Project

Analyzing variance of splice variants in colorectal cancer from nanopore reads

Jonathan Dayton, Lucas Pinto, and PJ Tatlow

Data

Statistical analysis

Alignment

Data Quality

About

Releases

Packages

Contributors 2

Languages

License

pjtatlow/bio465-capstone

Folders and files

Latest commit

History

Repository files navigation

Brigham Young University, Bio 465 - Capstone Project

Analyzing variance of splice variants in colorectal cancer from nanopore reads

Jonathan Dayton, Lucas Pinto, and PJ Tatlow

Data

Statistical analysis

Alignment

Data Quality

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages