-
Notifications
You must be signed in to change notification settings - Fork 12
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Error - ValueError: could not convert string to float: 'EMPTY' #32
Comments
Hi Joy, Sorry about this issue. I think I have an idea of what is causing it though. Would you be able to share the final output .csv files that were generated, along with the .depth file? That would help me narrow this down. Thanks, |
Hi Arkadiy, Best, |
Hi Joy, Thanks for sharing that. That's helpful. Could you please also please provide this file: "Bam_file_for_fegenie.txt". Thanks, |
Actually, is it fair to assume that in your provided "Bam_file_for_fegenie.txt", the first column of each row contains the genome/bin name that does not include the '.fasta' extension? If so, could you please try re-running with the .fasta extension included in the bin names? FeGenie expects the first column of that file to contain bin names exactly as they appear in the second column of the "FeGenie-geneSummary.csv" file. Does that make sense? |
Your suggestion worked! I got a .tiff dotplot after amending the Bam_file_for_fegenie.txt file to include the fasta extensions. Would there be more than one dotplot, though? One for each sample (metatranscriptome)? Thank you! |
Hey Joy, Great! Glad to hear that including the .fasta extensions worked. I will edit the wiki/readme to make this input more clear for users. Currently, FeGenie creates a single dotplot that is based on the total average coverage (third column in the generated depth files). However, I just added a flag to the program Thanks, |
Thanks for this! I reinstalled FeGenie with conda and ran the following: Even though I used "--skip", it still was finding ORFs....anyway, it errored out: Finding ORFs for Desulfosarcina_stnAB.fasta Am I supposed to run this in the directory with the .depth files? |
Ah, shoot - stupid bug. Just fixed it and pushed the changes to the GitHub repo. It shouldn't be finding ORFs if you provide the same output directory as you did in your initial run. Are you by chance providing a new directory name to correspond to the specific
Also, i just remembered that I need to specify a new release of FeGenie, and then the bioconda bot will push the new changes to the FeGenie recipe in bioconda. And that actually takes a few days. In other words, all the changes that I made to FeGenie today will not show up on your end if you just redo the conda install. You'd have to download the code from GitHub via:
I just addded the setup-conda-env.sh file to the repository, and it should create a FeGenie-compatible conda environment for you, using the updated code that is in this repository right now. Does this make sense? Sorry if this is all confusing...I'm still learning about all this conda and github stuff myself. Thanks, |
Hi Arkadiy, A for loop was exactly my next step after an initial run to make sure I understood that it worked! Thank you I will do a reinstall with the instructions you provided, thank you! I'll let you know how it goes. |
Hi again, It seems that FeGenie.py is not in the path any longer for me to summon without providing the absolute file location to the script. I'm trying a run on everything at once again (without the (fegenie) lloydlab@barbara:~/klds1515/SVA08.16_Metagenomes/071817KLmetagenome-45705703/Bins_from_Derep_Dastool/Fegenie$ /home/lloydlab/FeGenie/FeGenie.py -bin_dir /home/lloydlab/klds1515/SVA08.16_Metagenomes/071817KLmetagenome-45705703/Bins_from_Derep_Dastool/Fasta_files/Indiv_Anvio_fastas -bin_ext fasta -out /klds1515/SVA08.16_Metagenomes/071817KLmetagenome-45705703/Bins_from_Derep_Dastool/Fegenie/Output_redone -t 16 -bams Bam_file_for_fegenie.txt --makeplots --norm During handling of the above exception, another exception occurred: Traceback (most recent call last): **and then just a test run with test_run.sh: I promise I ran setup-conda-env.sh!! :) |
oh shoot, sorry for the delay in getting back to this. Let me see what might be going on. |
So there were a couple of issues with the setup-conda-env.sh file, which I have not fixed. But it seems that in order for this to work, you need to manually input FeGenie into your PATH in the .bash_profile file, which should be in your home directory. Alternatively, this file may be called .profile. Could you please delete the FeGenie repository that you have downloaded previously, and try again with the |
HI Arkadiy, |
Hmm, spoke too soon-- Error: iron_oxidation,Acidobacteria_stnAC.fasta,c_000000000041_24,Cyc2_repCluster3,88.4,27.3,558,1,MSRTIGSLSRCSMAGAALFSLAALAILCATPANAIPAFARKYETSCQTCHVAYPKLNTFGQAFRLLGYRMPGETEGQVKRPDVALGAASYKRVWPDAVWPGAIPQNLPLSLVANFQVQNSSQIEIEDGEVHRDTVNNDMIFPSEVALVVAGTAGEHVSYFGEIGFEQSVEAGMIEQEVGVEHIDIRFIRPIRNSMAFNVKIGSFQPELVSGFDHARRLTVANYDSMFGVSPIQSGGTEIVGGGGHHGGGGGISLPAVGRGIDLYGVVSHRFTWAAGILNGIGPGDATFDANSGKDTYVKGAYKWGGLAPDGSNAAVYAGSPKNWREKSVQVGVFYYRGDGKDIFFREEHDELGVIEVIFVEDPDYTRTGLDFNWYFKDLNIFGAYVSGDDDLRIYADSVTEPGEPGVFDPDESGTYTYKSWFVEADMVLGMPWLHGAVRYETVDLPRAEDGLKVQAYERATLSMTALVRANVKGVMEYTEDLNESRNYQFWLGAGIAF* processing... Acidimicrobiia_stnAC During handling of the above exception, another exception occurred: Traceback (most recent call last): |
Crud, sorry about that. Looks like I left Let me know how it goes. Thanks, |
Hi again, Creating depth matrix file: /klds1515/SVA08.16_Metagenomes/071817KLmetagenome-45705703/Bins_from_Derep_Dastool/Fegenie/Output_redone/Woeseia2_stnF.depth |
Shoot...but we're making progress down the script! Baby steps, ha. This error actually reminds me of the first one that you had, at the top of this issue thread. Any chance you're providing a Bam_file_for_fegenie.txt file with .fasta extensions missing from the first column? If not, then something else must be going on, and if you can share one of the .depth output files, along with the .csv outputs, that would be helpful. Thanks again, and sorry that FeGenie keeps crashing on you! |
No need to apologize! I appreciate you working with me on this. |
sounds good. Let me know how it goes. |
Hi Arkadiy, Latest error: From command: |
No worries, and thanks for sending the files. I can actually see what the cause of this error is from the command and error message. It is a small bug related to the last update, where I added the Thanks! |
So close!! Latest after removing previous conda environment, pulling down new repo, doing the conda setup, and running a successful test: |
Hey Joy, Sorry for the delay! I am looking into this now. Thanks for your patience. Arkadiy |
Hey! I fixed the bug that was causing this issue. This happens to be higher in the code then the previous errors, and I am not sure why that is, and why this bug was silent during your previous runs. In any case, if you can start from a fresh download of this script, there shouldn't be any other issues. But let me know if there are! Thanks again for your patience! Thank you! |
Hi Arkadiy, Creating depth matrix file: /klds1515/SVA08.16_Metagenomes/071817KLmetagenome-45705703/Bins_from_Derep_Dastool/Fegenie/Output_redone/Woeseia2_stnF.fasta.depth |
Thanks, Joy! Glad that you got the main output files. I just fixed the final little bug on line 2771, and added the R Package ggpubr to the list of packages installed with the setup-conda-env.sh script. So if you re-do the git clone and conda installation, you should avoid these errors. Thanks again for using FeGenie and for your patience as we deal through these bugs :) Cheers, |
Hi Arkadiy, Thanks! |
Thanks, Joy! That's so nice of you. But no token necessary. I am happy to help :) Perhaps if we're ever at a conference together, we can get together and talk some science over coffee or something. Glad to hear that the latest install worked. Looking forward to seeing what comes from this analysis! Don't hesitate to reach out if you have any other questions. Cheers, |
Hi again! It provided this error: Thanks for your help! |
Hi Joy! Sorry for the delayed response, especially since the fix for this was literally a single letter that was missing from that line in the script - but those are sometimes the hardest bugs to catch. I also fixed up a few other bugs to deal with the "Execution halted" errors from R, so the whole thing should be ready for the for-loop again. Let me know how it goes! Cheers, |
Hi Arkadiy, |
Hi Joy, I just noticed a mistake in the for-loop that I sent you. Instead of Thanks! |
Hi Arkadiy, |
Awesome, glad that it's working for you now! Have fun diving into the data. Looking forward to reading all about this study once it's published :) Let me know if you have other questions or issues. Cheers, |
Hi Arkadiy, I opened a similar issue months ago but never got back to you because I saw part of the problem was with my assembly and bam files, so that took me a bit to figure out (as well as finals, conferences, etc. haha.) and I couldn't find my original post so I hope its okay to add it here. Sorry for leaving you off the hook like that! but I am having a similar issue as above, I have attached my code and output but I am not sure if this is a bug or something on my end. any help would be heaven sent! Thank you!!! |
Hi there, Thanks again for your interest in FeGenie! I'd be happy to help out with this. Could you please send me the command that you're using and the file you are providing via the -bams flag? Thanks, |
Thank you! sorry for my delayed response, I had a conference this week and I am in the midst of moving into my first house. I am not using a bams file at the moment because it was just a test run with one file so I used the -bam command. but this is this command: FeGenie.py -bin_dir ~/bioinformatics/metagenomic_files/RQCF_output_3_unmerged/BFC_corrected/out.megahit.A.samples/ -bin_ext fa -out bam_fegenie_out --makeplots -t 6 -bam ~/bioinformatics/metatranscriptomics/raw_fastq_files/unmerged_rqcfiles/WD-1_S1.anqrpt.fastq.pairedMapped_sorted.bam --meta I tried to attach the bam file too but it wouldn't let me load it here. I can email it to you if needed! Thank you! |
Hi, Thanks for that information, and congrats on the new house! Based on this information, I would guess that the contig names in the FASTA files inside the "megahit.A.samples" directory differ from the contig names that were used to generate the BAM file. Can you please double-check the FASTA file used to generate the WD-1_S1.anqrpt.fastq.pairedMapped_sorted.bam file, and compare them to the header names in the megahit.A.samples FASTA files. Also, if there is the pipe character '|' in header names, that may also cause problems, since FeGenie uses that character as a delimiter in some parts of the code. Let me know if you find any differences! Thanks, |
Hi there,
I am aiming to make plots with coverage data for several bam files. I get .depth files and .csv files (as well as ORF_calls and HMM_results. However, there are no plots. Here is my input and the error:
FeGenie.py -bin_dir /home/lloydlab/klds1515/SVA08.16_Metagenomes/071817KLmetagenome-45705703/Bins_from_Derep_Dastool/Fasta_files/Indiv_Anvio_fastas -bin_ext fasta -out /klds1515/SVA08.16_Metagenomes/071817KLmetagenome-45705703/Bins_from_Derep_Dastool/Fegenie/Output/Plots -t 16 -bams Bam_file_for_fegenie.txt --makeplots --norm
Thread 0 finished: VKAB123_transcripts_MAGs_no_ribo.bam with 15432 reads and 8719 readsWellMapped
Creating depth matrix file: /klds1515/SVA08.16_Metagenomes/071817KLmetagenome-45705703/Bins_from_Derep_Dastool/Fegenie/Output/Plots/Woeseia_stnF.depth
Closing most bam files
Closing last bam file
Finished
processing... Woeseia_stnF
Traceback (most recent call last):
File "/home/lloydlab/anaconda3/envs/fegenie/bin/FeGenie.py", line 3020, in
main()
File "/home/lloydlab/anaconda3/envs/fegenie/bin/FeGenie.py", line 2586, in main
Dict[cell][process].append(float(depthDict[cell][contig]))
ValueError: could not convert string to float: 'EMPTY'
Thank you!
The text was updated successfully, but these errors were encountered: