Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

(At least) GTDB fails to download #171

Closed
erikrikarddaniel opened this issue Aug 25, 2023 · 6 comments
Closed

(At least) GTDB fails to download #171

erikrikarddaniel opened this issue Aug 25, 2023 · 6 comments
Assignees
Labels
bug Something isn't working

Comments

@erikrikarddaniel
Copy link
Member

Description of the bug

I ran the test case with eukulele/gtdb but the download failed:

ERROR ~ Error executing process > 'NFCORE_METATDENOVO:METATDENOVO:SUB_EUKULELE:EUKULELE_DOWNLOAD ($meta.id)'

Caused by:
  Process `NFCORE_METATDENOVO:METATDENOVO:SUB_EUKULELE:EUKULELE_DOWNLOAD ($meta.id)` terminated with an error exit status (1)

Command executed:

  pushd eukulele
  
  EUKulele \
      download \
       \
      --database gtdb
  
  popd
  touch gtdb
  
  cat <<-END_VERSIONS > versions.yml
  "NFCORE_METATDENOVO:METATDENOVO:SUB_EUKULELE:EUKULELE_DOWNLOAD":
      eukulele: $(echo $(EUKulele --version 2>&1) | sed 's/Running EUKulele with command line arguments, as no valid configuration file was provided.//; s/The current EUKulele version is//g')
  END_VERSIONS

Command exit status:
  1

Command output:
  ~/dev/nf-core-metatdenovo/work/17/8445ecb2cd7a6068a6d58fe253433f/eukulele ~/dev/nf-core-metatdenovo/work/17/8445ecb2cd7a6068a6d58fe253433f
  Running EUKulele with command line arguments, as no valid configuration file was provided.
  Specified reference directory, reference FASTA, and protein map/taxonomy table not found. Using database in location: ./gtdb.
  Automatically downloading database gtdb . If you intended to use an existing database folder, be sure a reference FASTA, protein map, and taxonomy table are provided. Check the documentation for details.
  Specified reference database, gtdb is not supported.

Command error:
  Unable to find image 'quay.io/biocontainers/eukulele:2.0.3--pyh723bec7_0' locally
  2.0.3--pyh723bec7_0: Pulling from biocontainers/eukulele
  73349e34840e: Already exists
  acab339ca1e8: Already exists
  935c26139724: Pulling fs layer
  935c26139724: Verifying Checksum
  935c26139724: Download complete
  935c26139724: Pull complete
  Digest: sha256:7933d7d1a2a15a96aef32fd3c49edecb5a5850de0c344ff3920a06a135267578
  Status: Downloaded newer image for quay.io/biocontainers/eukulele:2.0.3--pyh723bec7_0
  ~/dev/nf-core-metatdenovo/work/17/8445ecb2cd7a6068a6d58fe253433f/eukulele ~/dev/nf-core-metatdenovo/work/17/8445ecb2cd7a6068a6d58fe253433f
  Running EUKulele with command line arguments, as no valid configuration file was provided.
  Specified reference directory, reference FASTA, and protein map/taxonomy table not found. Using database in location: ./gtdb.
  Automatically downloading database gtdb . If you intended to use an existing database folder, be sure a reference FASTA, protein map, and taxonomy table are provided. Check the documentation for details.
  Specified reference database, gtdb is not supported.

Work dir:
  /home/daluab/dev/nf-core-metatdenovo/work/17/8445ecb2cd7a6068a6d58fe253433f

Tip: when you have fixed the problem you can continue the execution adding the option `-resume` to the run command line

 -- Check '.nextflow.log' file for details

A gtdb subdirectory was created in eukulele but it was empty.

Invoking EUKulele download --database gtdb from the terminal seems to work fine.

Command used and terminal output

# Without having a downloaded gtdb database:
nextflow run . -profile test,docker -resume --assembler rnaspades --eukulele_db gtdb --skip_eukulele false

Relevant files

No response

System information

No response

@erikrikarddaniel erikrikarddaniel added the bug Something isn't working label Aug 25, 2023
@erikrikarddaniel erikrikarddaniel added this to the nf-core candidate milestone Aug 25, 2023
@danilodileo
Copy link
Collaborator

Could it be EUKulele old version? I can take a look

@danilodileo
Copy link
Collaborator

By changing the container, the error persists. The module is able to download the reference.pep.fa and taxonomy-table.txt but fails to download the protein table prot-map,json . I would try to download all the files outside the module with the same version to be sure nothing is wrong in the EUKulele version

@erikrikarddaniel
Copy link
Member Author

I did just that and it worked fine: Download with Eukulele outside the workflow, then run the workflow with downloaded files.

@danilodileo
Copy link
Collaborator

I am trying to download another database (PhyloDB) through metatdenovo. if it not working then the issue is in our code

@danilodileo
Copy link
Collaborator

This issue can be closed as we don't have any module EUKULELE:DOWNLOAD. but the issue on downloading GTDB persist and it's reported in another issue #256

@akrinos
Copy link

akrinos commented Jul 31, 2024

Just putting a comment here that I will be addressing this issue within EUKulele in a release in the next 2 weeks. In the meantime, there is a workaround for using the standalone EUKulele tool and attempting this download AlexanderLabWHOI/EUKulele#58

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

3 participants