Skip to content

Commit

Permalink
Add busco list datasets component (#18)
Browse files Browse the repository at this point in the history
* add script and config to list busco datasets

* update test

* update busco to point to busco_list_datasets component

* add changelog entry

* update changelog

* Update CHANGELOG.md

Co-authored-by: Robrecht Cannoodt <[email protected]>

* rename busco to busco run

---------

Co-authored-by: Robrecht Cannoodt <[email protected]>
  • Loading branch information
dorien-er and rcannood authored Feb 11, 2024
1 parent 487c5d2 commit 4c50ac9
Show file tree
Hide file tree
Showing 11 changed files with 62 additions and 3 deletions.
4 changes: 3 additions & 1 deletion CHANGELOG.md
Original file line number Diff line number Diff line change
Expand Up @@ -8,7 +8,9 @@

* `fastp`: An ultra-fast all-in-one FASTQ preprocessor (PR #3).

* `busco`: Assess genome assembly and annotation completeness with single copy orthologs (PR #6).
* `busco`:
- `busco/busco_run`: Assess genome assembly and annotation completeness with single copy orthologs (PR #6).
- `busco/busco_list_datasets`: Lists available busco datasets (PR #18)

* `featurecounts`: Assign sequence reads to genomic features (PR #11).

Expand Down
36 changes: 36 additions & 0 deletions src/busco/busco_list_datasets/config.vsh.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,36 @@
functionality:
name: busco
description: Lists the available busco datasets
info:
keywords: [lineage datasets]
homepage: https://busco.ezlab.org/
documentation: https://busco.ezlab.org/busco_userguide.html
repository: https://gitlab.com/ezlab/busco
reference: "10.1007/978-1-4939-9173-0_14"
licence: MIT
argument_groups:
- name: Outputs
arguments:
- name: --output
alternatives: ["-o"]
direction: output
type: file
description: |
Output file of the available busco datasets
required: false
default: busco_dataset_list.txt
example: file.txt
resources:
- type: bash_script
path: script.sh
test_resources:
- type: bash_script
path: test.sh
platforms:
- type: docker
image: quay.io/biocontainers/busco:5.6.1--pyhdfd78af_0
setup:
- type: docker
run: |
busco --version | sed 's/BUSCO\s\(.*\)/busco: "\1"/' > /var/software_versions.txt
- type: nextflow
6 changes: 6 additions & 0 deletions src/busco/busco_list_datasets/script.sh
Original file line number Diff line number Diff line change
@@ -0,0 +1,6 @@
#!/bin/bash

## VIASH START
## VIASH END

busco --list-datasets | awk '/^#{40}/{flag=1; next} flag{print}' > $par_output
15 changes: 15 additions & 0 deletions src/busco/busco_list_datasets/test.sh
Original file line number Diff line number Diff line change
@@ -0,0 +1,15 @@
#!/bin/bash

## VIASH START
## VIASH END

"$meta_executable" \
--output datasets.txt

echo ">> Checking output"
[ ! -f "datasets.txt" ] && echo "datasets.txt does not exist" && exit 1

echo ">> Checking if output is empty"
[ ! -s "datasets.txt" ] && echo "datasets.txt is empty" && exit 1

rm datasets.txt
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
functionality:
name: busco
name: busco_run
description: Assessment of genome assembly and annotation completeness with single copy orthologs
info:
keywords: [Genome assembly, quality control]
Expand Down Expand Up @@ -35,7 +35,7 @@ functionality:
required: false
description: |
Specify a BUSCO lineage dataset that is most closely related to the assembly or gene set being assessed.
The full list of available datasets can be viewed [here](https://busco-data.ezlab.org/v5/data/lineages/) or by running `busco --list-datasets` (which requires installing the tool).
The full list of available datasets can be viewed [here](https://busco-data.ezlab.org/v5/data/lineages/) or by running the busco/busco_list_datasets component.
When unsure, the "--auto_lineage" flag can be set to automatically find the optimal lineage path.
Requested datasets will automatically be downloaded if not already present in the download folder.
example: stramenopiles_odb10
Expand Down
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.

0 comments on commit 4c50ac9

Please sign in to comment.