-
Notifications
You must be signed in to change notification settings - Fork 8
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Resistance gene not assembled in primary contig file, but is present in alternate contig file #30
Comments
In your case, please use both primary and alt contig files, i.e. As for why ARGs are in the alt, my guess is that there were haplotypes without the ARGs and perhaps with higher coverage, so graph cleaning dropped the ARGs. You can pick a couple of reads from an ARG contig and grep them in the r_utg graph to check. |
Our samples are fecal sample metagenomes with only haploid bacterial DNA, so haplotypes are not relevant in our case. We found the ARG sequence within a 554 kb segment in the r_utg.gfa file. Out of that 554 Kb segment there is ~370Kb of its sequence not in the p_ctg.gfa file, and the full 554 kb is in the a_ctg.gfa file. Why is this sequence left out of the .p_ctg.gfa file? |
Sorry for my confusing comment, meant to say "there were
Is that a unitig? (And it is divided up in the contig graphs?) What are the coverages of the segment and contigs? Coverage is the If you don't mind sharing the {r_utg, p_ctg, a_ctg}.gfa files via email, I can have a look. |
Thank you for the response! I have emailed you the requested files.
The "554 kb segment in the r_utg.gfa file" I am referring to appears
to be a unitig as it is identified with "utg" in the beginning. The
coverage is dp:f:11. The 554kb segment in the r_utg.gfa does not
appear to be divided up in the contig graph based on an end-to-end
alignment to an a_ctg segment. More details about coordinates are in
the email I sent.
…On Wed, Nov 8, 2023 at 3:45 PM xfengnefx ***@***.***> wrote:
with only haploid bacterial DNA
Sorry for my confusing comment, meant to say "there were haplotypes closely related strains [with and] without the ARGs". For genomes that have less than 1% whole genome diversity, hifiasm-meta currently usually will not separate them.
a 554 kb segment in the r_utg.gfa file
Is that a unitig? (And it is divided up in the contig graphs?) What are the coverages of the segment and contigs? Coverage is the dp:f field of S lines in the gfa files. Bandage will show it on the right side when you select a node, too.
If you don't mind sharing the {r_utg, p_ctg, a_ctg}.gfa files via email, I can have a look.
—
Reply to this email directly, view it on GitHub, or unsubscribe.
You are receiving this because you authored the thread.Message ID: ***@***.***>
|
Got the mail, thanks! Will check tonight. |
In a comparison of metagenomic assemblies made only from illumina short-read data (metaSPADES) to hifiasm-meta assemblies (the primary contig file, .p_ctg.gfa), we found an ARG assembled in a metaSPADES assembly that was not present in the hifiasm-meta assembly for that sample. ~20 HiFi reads align well to the ARG, but the ARG is not assembled with hifiasm-meta. However, the ARG is assembled in the alternate contig file made by hifiasm-meta (.a_ctg.gfa), along with the .r_ctg.gfa and .p_utg.gfa files. How would you recommend we run hifiasm-meta such that the ARG content we care about lies solely within the primary contig file, or should we find ARGs within both primary and alternate contig files? ARGs are usually surrounded by mobile genetic elements and usually belong to low-abundance species.
The text was updated successfully, but these errors were encountered: