Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

The meaning and impact of genes with 'TaxID' of 0 in the HGTector2 prediction result #131

Open
chenhuag opened this issue Nov 9, 2023 · 4 comments

Comments

@chenhuag
Copy link

chenhuag commented Nov 9, 2023

I used HGTector2 to predict HGTs in my genome, and it classified my genome as "phylum Proteobacteria". The prediction results showed that 318 genes were identified as HGTs, but only 40 of them had a 'TaxID' and were eukaryotes and archaea. The 'TaxID' for 278 genes was 0. Can you help me understand the meaning of this result and the impact on the analysis?

@qiyunzhu
Copy link
Contributor

qiyunzhu commented Nov 9, 2023

Hello @chenhuag Thanks for your interest in this program! 318 genes being identified as HGT-derived means that they have an atypical homology search pattern which is likely attributed to HGT. 278 genes having the potential donor identified as TaxID 0 meaning that the potential donor cannot be identified. Since you were examining a very deep taxonomic level (phylum), it could be that the donor information was already lost or attenuated throughout the history of evolution. Hope it helps!

@chenhuag
Copy link
Author

Thank you for your answer. I check the log file and I get a warning message "WARNING: Cannot cluster distal group using KDE. Use fixed threshold 25 instead". Is this related to this result?

@qiyunzhu
Copy link
Contributor

Your understanding is correct.

@hhj00123
Copy link

7e9ff44c20912403657cceda8a705c1 Hi,I have a question:when I set up the self- and close- taxa, I found a large number of HGT events were annotated with the source as N/A. Can I assume that a significant portion of these HGT events are very similar to the close- or self- groups we defined, making it difficult to determine their origin?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants