Hi there, my github account did not notify me when there are issue. So if you are in a hurry, you can email me. [email protected]. I check email every day.
You can install from this GitHub repository for latest (often development) version by following command line
pip install -U git+https://github.com/StatBiomed/CamoTSS
In either case, add --user
if you don't have the write permission for your
Python environment.
You can download test file from figshare.
Here, you can download some large file include genome.fa, possorted_genome_bam_filtered.bam.
Here are three modes in CamoTSS : TC+CTSS , TC and CTSS.
When you run TC+CTSS mode, you will get TC result and then get the CTSS result based on the TC.
When you run TC mode, you will only get the TC result.
The TC+CTSS and TC mode have the same required files.
The --outdir is the only required parameter for CTSS mode. But the outdir should include output of TC.
If you want to run CTSS mode, you must based on the output of TC.
You can run CamoTSS TC+CTSS mode by using test file according to the following code.
Note You should use the same reference gtf file and reference fasta file as that you used during alignment. In other words, if you run alignment by using cellranger, then the gtf file and fasta file should located in the refdata-gex-GRCh38-2020-A/fasta/genome.fa and refdata-gex-GRCh38-2020-A/genes/genes.gtf.
For the remaining modes, you can check this document.
#!/bin/bash
gtfFile=$download/Homo_sapiens.GRCh38.105.chr_test.gtf
fastaFile=$download/genome.fa
bamFile=$download/possorted_genome_bam_filtered.bam
cellbarcodeFile=$download/cellbarcode_to_CamoTSS
CamoTSS --gtf $gtfFile --refFasta $fastaFile --bam $bamFile -c $cellbarcodeFile -o CamoTSS_out --mode TC+CTSS
To identify alternative TSS usage or alternative CTSS usage, Brie2 (Huang & Sanguinetti, 2021) is recommend to be used.
Here, we provide an example exploiting BRIE2 to detect alterntive TSS/CTSS usage.
You can check it in our manual.
The full manual is here, including:
Hou, R., Hon, CC. & Huang, Y. CamoTSS: analysis of alternative transcription start sites for cellular phenotypes and regulatory patterns from 5' scRNA-seq data. Nat Commun 14, 7240 (2023). https://doi.org/10.1038/s41467-023-42636-1