Skip to content

A repository for the analysis involved in "Evidence for widespread class II microcins in Enterobacterales genomes"

License

Notifications You must be signed in to change notification settings

wilkelab/cinful_data_analysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

41 Commits
 
 
 
 
 
 

Repository files navigation

microcins_enterobacterales

A repository for the analysis involved in "Evidence for widespread class II microcins in Enterobacterales genomes"

Getting started

Below are instructions for reproducing the analysis to identify microcins in Enterobacterales genomes. The two most important components are to have cinful installed and to download the genome assemblies.

cinful

Documentation on cinful can be found at https://github.com/wilkelab/cinful

Data download

This repository contains the accession numbers and descriptions of all genome assemblies used in this analysis. There is more than one way to retrieve these files. The easiest is with ncbi-genome-download.

The data are split into three groups. The first is the E. coli pan genome from Touchon et al. (2017). Second are genome assemblies from Enterobacteriaciae from species in GTDB that have at least 20 assemblies per genus. Finally, there is a representation of one assembly per species from other Enterobacterales.

Final output

This repository includes final output from the cinful pipeline. These final output files were analyzed to generate the figures for the paper. The analysis for each figure is stored in full across separate jupyter notebooks.

About

A repository for the analysis involved in "Evidence for widespread class II microcins in Enterobacterales genomes"

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published