Skip to content

Python package to find chimeras in CRAC/CLASH and HITS-CLIP datasets

License

Notifications You must be signed in to change notification settings

kashyapchhatbar/CLASHChimeras

Repository files navigation

CLASHChimeras is a Python package for analysing CLASH datasets. It takes raw fastq files as input and provides comprehensive analysis of RNA profiles and chimeric reads identification. The output is CSV and BED format files for easy visualization in Genome Browsers.

CLASHChimeras requires certain software to be installed and setup before you can use it completely. The software you need to explicitly install are the following:

  • Bowtie2 - Fast and sensitive read alignment
  • Tophat - A spliced read mapper for RNA-Seq

The package can be used by three executable scripts

  1. download-for-chimeras
  2. align-for-chimeras
  3. find-chimeras
$ download-for-chimeras -gor "H.sapiens" -mor hsa

Downloads required sequences and create bowtie2 indexes required for alignment

$ align-for-chimeras -i input.fastq -si /path/to/database -ti
/path/to/database
 -o output

Given a fastq file, this script executes bowtie2 and tophat aligners to generate alignment files necessary for detecting chimeras in the reads

$ find-chimeras -s smallRNA.sam -t targetRNA.sam -o output

Given two SAM files, this script tries to find chimeras that are observed between a smallRNA and a targetRNA

Check out the documentation for installation instructions and detailed usage with example

About

Python package to find chimeras in CRAC/CLASH and HITS-CLIP datasets

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages