Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Some elements have more reads in unique mode vs multi mode #44

Open
R-Najjar opened this issue May 1, 2024 · 6 comments
Open

Some elements have more reads in unique mode vs multi mode #44

R-Najjar opened this issue May 1, 2024 · 6 comments
Labels
input quirks Issue caused by quirks in input file

Comments

@R-Najjar
Copy link

R-Najjar commented May 1, 2024

Hi,
I ran TElocal in multi and uniq modes to get an idea on how much read support was coming from unique vs multimapping reads, and I found some elements with more reads in unique mode than multi mode. How is this possible? In other words, why weren't these unique reads counted in multi mode, which should count all reads, correct? This happened in 47 out of out of 657 elements that I was studying from 70 samples. The differences ranged from 1-4 reads.

Thanks,
Rayan

@olivertam
Copy link
Member

Hi,

Could you provide the command lines that you used for TElocal?
Can you also describe how you aligned the reads.

Thanks.

Thanks.

@olivertam olivertam added the input quirks Issue caused by quirks in input file label May 1, 2024
@R-Najjar
Copy link
Author

R-Najjar commented May 1, 2024

Hi Oliver,
Thank you. I used STAR for alignment

STAR --runThreadN 24 --genomeDir /t2t/nw/star --runMode alignReads --readFilesIn ${SAMP}.1.fastq.gz ${SAMP}.2.fastq.gz --readFilesCommand zcat --outFileNamePrefix /t2t/nw/bams/${SAMP} --outFilterMultimapNmax 100 --winAnchorMultimapNmax 100 --twopassMode Basic --outSAMtype BAM Unsorted

And here are the two TElocal runs

apptainer exec --bind /gscratch telocal.sif TElocal --mode uniq --stranded reverse --project /t2t/nw/sines/local_uniq_${SAMP} -b ${SAMP}Aligned.out.bam --GTF /t2t/chm13v2.0_RefSeq.gtf --TE /t2t/T2T-CHM13v2_rmsk_TE.gtf.locInd

apptainer exec --bind /gscratch telocal.sif TElocal --mode multi --stranded reverse --project /t2t/nw/sines/local_multi_${SAMP} -b ${SAMP}Aligned.out.bam --GTF /t2t/chm13v2.0_RefSeq.gtf --TE /t2t/T2T-CHM13v2_rmsk_TE.gtf.locInd

@olivertam
Copy link
Member

Hi,

I was able to reproduce your issue.
We're taking a closer look at it.

Thanks.

@R-Najjar
Copy link
Author

R-Najjar commented May 4, 2024

I appreciate it. Please let me know what you find. I checked if the same happens in TEtranscripts, and it does, it's not common, but it does happen. I found 27 occurrences in 21 elements, with a range of difference of 1-9 between unique and multi reads.
uniq.more.than.multi.csv

Thanks

@R-Najjar
Copy link
Author

Hi Oliver, do you have an update on this? I'd appreciate it
Thank you

@olivertam
Copy link
Member

Hi,

We are still pinpointing the possible source.

Thanks.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
input quirks Issue caused by quirks in input file
Projects
None yet
Development

No branches or pull requests

2 participants