-
Notifications
You must be signed in to change notification settings - Fork 5
Project 3
Nicolas edited this page Oct 22, 2021
·
25 revisions
Many cancers are at least partly initiated by the expression of aberrant transcript isoforms. RNA-seq is often analyzed using known annotated isoforms but tumors can also express novel isoforms that can only detected using dedicated tools.
- simple fastq file, available at https://github.com/IARCbioinfo/data_test
- basic bash scripting (running a software in command line)
- nextflow programming (good understanding of practical 1)
- download the https://github.com/IARCbioinfo/data_test
- download and test software SUPPA2 (see documentation)
- write a nextflow pipeline, following the IARCbioinfo template (IARCbioinfo/template-nf). The script would require (1) a process that trims reads, (2) a process to run RNA-seq quantification with salmon; (2) a process that runs SUPPA2.
- Optional, if time left: running the script on an actual cancer case from SRA
- coding in Nextflow
Tips:
- look at examples of nextflow pipelines (e.g., IARCbioinfo/fastqc-nf), possibly using them as a starting point instead of doing it from scratch
[email protected] (Nicolas Alcala)