-
Notifications
You must be signed in to change notification settings - Fork 28
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Disagreement between Intervene and BEDtools #62
Comments
I want to add that there's also difference in results depending on what bed file you put first. |
Is there anybody can answer this question? |
Hi @lovelycatZ @Sarsaparella thanks for your interest and posting this. This is a known issue when doing genomic interval intersections. This is because, for genomic regions, you will not always have a one-to-one overlap. That's why you slight difference when changing the file order: For example, if you've three BED files (a, b and c) and when you run the
This is also explained well enough here by the pybedtools developer as posted by @amizeranschi #27 daler/pybedtools#45 (comment) Also, Intervene is using I hope this helps. |
Hi @asntech Please consider implementing Implementing You would just need to use this function from which is a wrapper for |
Hello,
Thank you for providing such a convenient tool!
I have installed Intervene via conda and tested with my own dataset
hap1.bed
andhap2.bed
to generate a Venn diagram using command lineintervene venn -i hap*.bed --names hap1,hap2 --save-overlaps
. Finally, in the Intervene_result folder, there were three files 1)01_hap2.bed; 2)10_hap1.bed; 3)11_hap1_hap2.bed. For the third file 11_hap1_hap2.bed, it contained 5206 intervals same with the intersection part of the Venn diagram that were found in both A and B, right? But it was strange that when I use BEDtools with command linebedtools intersect -a hap1.bed -b hap2.bed
, there were 5773 records in the result file.How to explain the the difference?
The text was updated successfully, but these errors were encountered: