-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Re-write HIVSeqinR in python #13
Comments
Sorry to disturb you. Has the python version been completed?Thank you. |
We're switching to HIVIntact instead, @shengquan-Tang. It's written in Python, and we've contributed a couple of pull requests. |
Get it. Thank you for your reply. By the way, HIVIntact can distinguish intact and defective sequcences, but can't distinguish, such as, "LargeDeletion, nonHIV". Is it right? Thanks again! |
One of our pull requests adds the large deletion error, @shengquan-Tang. If you need that feature, you could use our fork of the project. |
I truly appreciate your assistance. |
Hi @shengquan-Tang, also here is the repository of our HIVIntact fork: https://github.com/cfe-lab/HIVIntact We have added almost all previous error codes (such as But there are still many differences between the two programs. |
Hi, @Donaim. Your response is really significant and thrilling to me. I'll learn how to use it. I appreciate your assistance. |
Closing this issue, because we're switching to HIVIntact. |
Superseeded by #10 |
Hi, @donkirkby. Today, I intended to utilize HIVIntact, however, I encountered difficulties in accessing it. It remains unclear whether your have altered the pathway or discontinued its availability due to other reasons. Thanks for your help! |
Hi @shengquan-Tang. The URL you linked to, is to the older, original version of HIVIntact. Please use the following repository: https://github.com/cfe-lab/CFEIntact . We will base our proviral pipeline on it. So the command should be Pathways have changed, sorry for inconveniences caused by that. |
Thank you @Donaim. Yes, I have also tried https://github.com/cfe-lab/CFEIntact, it can be successfully installed. But it will show some errors whether it's my data or your test data. I don't know what went wrong. |
Hi @shengquan-Tang . Sorry for this error, it's due to a difference in python version. I have fixed it today. Please clone the repository again, to get the newest version of CFEIntact (v1.18.2), and then install it again. Thank you for letting us know about this issue! |
Thanks @Donaim, I will try again! |
Hi, @Donaim. I ran the command "cfeintact check --subtype B test.fasta" which includes the HXB2 (K03455.1) reference sequence. However, I noticed that CFEIntact was unable to recognize it as an intact sequence. The HXB2 sequence was categorized under "error.csv" and had issues like "FrameshiftInOrf" and "DeletionInOrf" (as shown in the first picture). Interestingly, despite having 9719 matches (as seen in the second picture), this outcome seems quite unreasonable. |
Hi @shengquan-Tang ! Thank you for your feedback. Addressing the issues you've brought up: HXB2 intactnessThe orignal HXB2 sequence is indeed defective. When CFEIntact performs it analysis, it relies on a modified version of HXB2 that does not contain any known defects. No intact sequences detectedCFEIntact's default configuration is rather strict. ReliabilityLet me present some points that made us trust CFEIntact. We've done multiple rounds of validations as we transitioned from HIVSeqinR. CFEIntact is reliable and stable. Its calls are backed by evidence-based studies. It follows software best practices, including static analysis, continuous integration, high test coverage, and is written in a safe and well-established programming language Python. Still, if you find obviously wrong calls (like a discrepancy between the error code and the description in the documentation), do inform us and we'd be happy to investigate further. More issuesAs I have already mentioned, we benefit from user feedback. |
Thanks @Donaim . |
Now that we are automatically launching the proviral pipeline as part of MiCall, it would make sense to re-write HIVSeqinR in python, or at least re-structure the existing R code. This would make the code a lot more maintainable for us, and would allow us to diagnose bugs more easily. Since the launch of the pipeline, we have encountered 2 errors that led to the pipeline failing, see #12.
Alternatively, re-visit replacing HIVSeqinR by another tool, as outlined in #10.
The text was updated successfully, but these errors were encountered: