Day 2 course data

2021 ISB Virtual Microbiome Symposium

Day 2 course data

This repository contains cached data and processing steps for day 2 of the symposium. This is split into two major pipelines: [1] Obtaining data from the BioML data set and processing it into assemblies and [2] gnerating carveME reconstructions for all assemblies with decent GTDB assignments.

Required compute: About 1000 CPU hours

Obtaining data and processing it

This is all wrapped into a nextflow pipeline which is provided along with this repository: assemly.nf. There is conda environment file to set up all required dependencies. it covers the following steps.

Downloading the first 1000 isolate genomes from the BioML paper.
Quality filtering and trimming with FASTP.
Assembly with MEGAHIT.
Taxonomic placement with the GTDB toolkit.

After that the data is curated by hand to remove isolates with no clear GTDB bacterial assignment. This is contained in an Rstudio notebook. This will leave a little less than 980 assemblies.

Model reconstruction

This done using the Gibbons Lab model builder pipeline. The required media are provided in the repository as well.

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
data		data
figures		figures
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
assembly.nf		assembly.nf
conda.yml		conda.yml
curation.nb.html		curation.nb.html
curation.rmd		curation.rmd
data_selection.nb.html		data_selection.nb.html
data_selection.rmd		data_selection.rmd
microbiome_course_data.Rproj		microbiome_course_data.Rproj

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

2021 ISB Virtual Microbiome Symposium

Day 2 course data

Obtaining data and processing it

Model reconstruction

About

Releases 1

Packages

Languages

License

Gibbons-Lab/2021_microbiome_course_data

Folders and files

Latest commit

History

Repository files navigation

2021 ISB Virtual Microbiome Symposium

Day 2 course data

Obtaining data and processing it

Model reconstruction

About

Resources

License

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

Packages