-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
what are the differences of different inovirus type in result file? #7
Comments
Hi,
Best, |
Realy thanks for your helpful and professional answers. Still, I have some questions. For answer 1 So the complete inovirus in the result file mean that all this contig is a inovirus and the complete inovirus in your paper mean circular inovirus and integrated prophages with canonical att sites. And How can I pick out the circular inovirus from the result file? Or how to identity circular inovirus technically? You did not mention it in the paper methods or supplyment document. For answer 3 As the pI marker of ALV1 has been included into the Final_marker_morph.hmm file, why did you include the other three proteins of ALV1 into the Marker_ALV1.faa file? What are the functions of them? Why can they serve as markers? If the reason is they are rare, it does not make sense. Because there are also many singletons in other inovirus. For answer 4 For tandem insertions, it is possible to treat the multiple pI protein independently. And tandem insertions could lead to clusters gathering multiple species. So if I want to cluster different inovirus into species, the tandem insertions should be processed separately as you mentioned in your paper, but you did not treat the potential multiple inovirus in tandem insertions independently, because it is hard to find the boundary of the multiple potential inovirus? Question: As you mentioned in supplyment document, sub-optimal genome assemblies yielding short contigs (i.e. < 5kb) will lead to a large amount of false negatives, as these short contigs will not include enough information to identify them as Thanks for your kindly reply! |
Extra question) I don't know that there is a way to manually inspect sequence < 5kb that would not be detected by the tool. You can certainly rule out a number of them because they will have bacterial genes that are unlikely to be encoded by an inovirus genome. But otherwise, you will have many cases where you will have only a pI-like protein and 1 or 2 hypothetical proteins around, and I don't think these are sufficient to determiner whether this short contig would be an inovirus or not. |
Thanks!!! Your suggestions are very helpful. Thanks very much! |
Hi, I have four more questions.
GCA_000847085.1_ViralProj14573_genomic_frag_X58839_1_2_annot_inovirus-predictions-refined.csv
GCA_000847085.1_ViralProj14573_genomic_frag_X58839_1_4_annot_inovirus-predictions-refined.csv
GCA_000847085.1_ViralProj14573_genomic_frag_X58839_1_7_annot_inovirus-predictions-refined.csv
GCA_000847085.1_ViralProj14573_genomic_frag_X58839_1_8_annot_inovirus-predictions-refined.csv
And this four inovirus region are the same. It may be related to the blast hits of Marker_ALV1.faa. So these four proteins in Marker_ALV1.faa are markers of pI?
Thanks for your kindly reply!
The text was updated successfully, but these errors were encountered: