Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Extract contig duplication from the self alignment map #131

Open
chklopp opened this issue Nov 14, 2024 · 0 comments
Open

Extract contig duplication from the self alignment map #131

chklopp opened this issue Nov 14, 2024 · 0 comments

Comments

@chklopp
Copy link
Collaborator

chklopp commented Nov 14, 2024

Is your feature request related to a problem? Please describe.
d-genies enables to produce self alignment maps. From these maps it is possible to infer duplicated contigs or contig ends. The idea would be to have a button and a slider (for the minimum size of the duplicated area) and a button alone (fixed value) enabling to produce a list of duplicates and the fasta file of the contigs without duplication

Describe the solution you'd like
From the paf file find duplicated areas coordinates (chaining alignments), from the list of coordinates find the minimum set of contigs and contig parts to remove in order to remove duplication.

Describe alternatives you've considered
Use image analysis to find lines in the plots rather than using the paf file.
Having contig read coverage would improve the duplication discovery. (a file to provide to dgenies?)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant