-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
N50 etc. are not calculated #34
Comments
Hi, Thank you for bringing this to our attention. This does seem like it could be a bug, but I'm having trouble identifying the root cause at the moment. Could you please share the full error message along with the exact command you ran? If possible, it would also be helpful if you could run Binette with the Thank you! |
Thank you for your quick response. The following is the result output in debug mode. We thought that the cause was that errors were occurring in specific bins when calculating N50, completeness and contamination, so we made some changes to the source code so that it could sort even with missing values, and found that only two of the bins were not outputting these scores. Also, strangely enough, if you only input the bins that are causing problems, no errors will occur (in this analysis, we used 6 bins, and also used some combinations of 2 bins for debugging, but the same errors occurred in all cases). I am sorry that I cannot provide you with actual data, but I will provide you with as much information as possible. 0%| | 0/31643 [00:00<?, ?it/s] 0%| | 0/31643 [00:00<?, ?contig/s] 0%| | 0/31643 [00:00<?, ?contig/s] |
Hi, Thanks for providing the additional information! How many threads are you allocating to Binette? The scoring step uses multiprocessing, so something unexpected might be happening. Could you try rerunning the analysis with just one thread ( When you mention:
Are you referring to 6 individual bins, or 6 bin sets? Also it looks like the logs you shared was generated without the |
Unfortunately, the same error occurs even when threads is set to 1. Also, I sent you a log that was not in debug mode. I apologise for this. [2024-12-17 22:11:56] DEBUG - Writing metrics for bin set 'concoct' to file: path/to/input_bins_2.concoct.tsv |
I was able to reproduce the error on my side ! So I will be able to dig more easily in it. |
We made great progress in resolving the problem, which was very good. The versions I have tried are 1.0.3 and 1.0.4, so the solution you presented is likely to work! I look forward to doing the analysis with this great tool that allows multiple bins to be entered at this high speed! |
The problem occurred because two input bins from different sets shared the same contig content which is not unexpected. While the bins were correctly dereplicated during scoring, binette used the undereplicated bins when writing the input bins per set. This included unscored bins, which caused the error. I made a fix in PR #36 and will release a new version asap. Thanks again for reporting this bug ! |
Thank you for your help! When the revised version is released, I will test it with my data too! |
Thank you for updating the software. Thank you for fixing it so quickly this time! |
Thank you for developing such a useful tool.
I am having trouble with the following error, which only occurs in a specific bin.
Due to the terms of the agreement, we cannot send the bin, but this is occurring with multiple bin files, so it is not thought to be a problem caused by the binner.
Also, if you run these bin files through checkm2, they will run normally and we will be able to get all the scores. Therefore, it is also unlikely that the problem is due to missing files.
Based on this, I suspect that it is a bug in binette that only occurs under certain conditions.
If you have any solutions or things to check, please let me know.
Thank you.
io_manager.py", line 139, in write_bin_info
for bin_obj in sorted(bins, key=lambda x: (x.score, x.N50, -x.id), reverse=True):
TypeError: '>' not supported between instances of 'float' and 'NoneType'
The text was updated successfully, but these errors were encountered: