Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat: add workflow and assemblies yml files first pass (#206) #207

Draft
wants to merge 4 commits into
base: main
Choose a base branch
from
Draft
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
95 changes: 95 additions & 0 deletions files/source/assemblies.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,95 @@
assemblies:
# plasmodium falciparum
- accession: GCF_000002765.6
ploidy: haploid
source: plasmodb.org
# plasmodium vivax
- accession: GCF_000002415.2
ploidy: haploid
ncbi_dataset: true
- accession: GCA_900093555.2
ploidy: haploid
source: plasmodb.org
# plasmodium yoelii
- accession: GCF_900002385.2
ploidy: haploid
source: plasmodb.org
# plasmodium vinckei
- accession: GCF_900681995.1
ploidy: haploid
source: plasmodb.org
# culex p. pallens
- accession: GCF_016801865.2
ploidy: diploid
source: vectorbase.org
# culex p. quinquefasciatus
- accession: GCF_015732765.1
ploidy: diploid
source: vectorbase.org
# anopheles gambiae
- accession: GCF_943734735.2
ploidy: diploid
ncbi_dataset: true
- accession: GCF_000005575.2
ploidy: diploid
source: vectorbase.org
# toxoplasma gondii
- accession: GCF_000006565.2
ploidy: haploid
source: toxodb.org
# mycobacterium tuberculosis
- accession: GCF_000195955.2
ploidy: haploid
# coccidioides posadasii
- accession: GCF_018416015.2
ploidy: haploid
ncbi_dataset: true
- accession: GCA_000170175.2
ploidy: haploid
source: fungidb.org
# coccidioides immitis
- accession: GCF_000149335.2
ploidy: haploid
source: fungidb.org
# trypanosoma cruzi
- accession: GCF_000209065.1
ploidy: diploid
source: tritrypdb.org
# trypanosoma brucei
- accession: GCF_000002445.2
ploidy: diploid
source: tritrypdb.org
# leishmania major
- accession: GCF_000002725.2
ploidy: diploid
source: tritrypdb.org
# leishmania donovani
- accession: GCF_000227135.1
ploidy: diploid
source: tritrypdb.org
# leishmania braziliensis
# danielle's favorite
- accession: GCF_000002845.2
ploidy: diploid
source: tritrypdb.org
# covid
- accession: GCF_009858895.2
ploidy: haploid
# mpox
- accession: GCF_000857045.1
ploidy: haploid
# aspergillus fumigatus
- accession: GCF_000002655.1
ploidy: haploid
source: fungidb.org
# candida albicans
- accession: GCF_000182965.3
ploidy: diploid
source: fungidb.org
# crypococcus neoformans
- accession: GCF_000091045.1
ploidy: haploid
ncbi_dataset: true
- accession: GCA_000149245.3
ploidy: haploid
source: fungidb.org
46 changes: 46 additions & 0 deletions files/source/galaxy_workflows.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,46 @@
workflows:
# this list of workflows and their annotations are a wip and will be updated as i learn more
# about their status and the tools used, etc.
- trs_id: "#workflow/github.com/iwc-workflows/chipseq-pe/main"
type: "REGULATION"
description: "Workflows for the analysis of ChIP-seq, ATAC-Seq, and beyond."
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

are these descriptions for the workflows? or for the 'type' ?

there will eventually be many workflows belonging to a 'type' (or whatever we end up calling it.. type is too vague imo), and i wonder if this source of duplication implies a need for a new yaml file?

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes, reading the descriptions, I agree they should like they describe the workflow type rather than a specific workflow within the type. I think this came from having only one workflow per type for the initial kickoff. Sounds like we do need a type (or "modality?) tab.

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think we could also add (or generate) links to view the workflow details in GitHub, Dockstore etc. or wherever they live.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can/should we align these 'types' with the collection as organized on dockstore (and iwc site)? (https://dockstore.org/organizations/iwc)

This can be programmatically fetched if so.

ploidy: "any"

- trs_id: "#workflow/github.com/iwc-workflows/chipseq-sr/main"
type: "REGULATION"
description: "Workflows for the analysis of ChIP-seq, ATAC-Seq, and beyond."
ploidy: "any"

- trs_id: "#workflow/github.com/iwc-workflows/rnaseq-pe/main"
type: "TRANSCRIPTOMICS"
description: "Analyze bulk or single-cell RNA seq data using a variety of approaches."
ploidy: "any"

- trs_id: "#workflow/github.com/iwc-workflows/rnaseq-sr/main"
type: "TRANSCRIPTOMICS"
description: "Analyze bulk or single-cell RNA seq data using a variety of approaches."
ploidy: "any"

- trs_id: "#workflow/github.com/iwc-workflows/haploid-variant-calling-wgs-pe/main"
type: "VARIANT_CALLING"
description: "Identify nucleotide polymorphisms and short indels from Illumina and Element data."
ploidy: "haploid"

# im starting to wonder looking at this if the default for ploidy shouldnt be any
# maybe it could be multivalued [haploid, diploid].. might not matter depending on ui
- trs_id: "#workflow/github.com/iwc-workflows/generic-variant-calling-wgs-pe/main"
type: "VARIANT_CALLING"
description: "Identify nucleotide polymorphisms and short indels from Illumina and Element data."
ploidy: "any"
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think this is meant for haploid / non-diploid organisms. @nekrut I think you wrote this one ?


- trs_id: "#workflow/github.com/iwc-workflows/assembly-with-flye/main"
type: "ASSEMBLY"
description: "Best practices for assembly of prokaryotic and eukaryotic genomes sequenced with a variety of technologies."
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The description is repeated, but also weird ? Maybe stick to the Workflow for ... pattern you've used above.

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yea I think these could use work. For now I was just replicating what was on the site as category descriptions.

ploidy: "any"

# idk yet what makes this specific to bacteria, or if it can be easily generalized
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks. maybe it's enough to annotate as haploid then..

# but we may need a new annotation once i figure it out
- trs_id: "#workflow/github.com/iwc-workflows/bacterial-genome-assembly/main"
type: "ASSEMBLY"
description: "Best practices for assembly of prokaryotic and eukaryotic genomes sequenced with a variety of technologies."
ploidy: "any"
28 changes: 28 additions & 0 deletions files/source/outbreaks.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,28 @@
outbreaks:
- outbreak:
tax_id: 2697049
resources:
- resource:
title: "Coronavirus: Update Related to the Current Outbreak of COVID-19"
url: "https://pmc.ncbi.nlm.nih.gov/articles/PMC7139198/"
- resource:
title: "COVID-19 Pandemic: A Review of the Global Response"
url: "https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7162758/"
- resource:
title: "The Impact of COVID-19 on Public Health"
url: "https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7275854/"
- resource:
title: "COVID-19: Epidemiology, Virology, and Clinical Features"
url: "https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7159299/"
- outbreak:
tax_id: 10244
resources:
- resource:
title: "Mpox Virus: Understanding the Outbreak"
url: "https://example.com/mpox-outbreak"
- resource:
title: "Mpox Virus: Symptoms and Treatment"
url: "https://example.com/mpox-symptoms-treatment"
- resource:
title: "Global Response to the Mpox Outbreak"
url: "https://example.com/global-response-mpox"
Loading