Skip to content

Pan genome analysis of closely related prokariotic genomes

License

Notifications You must be signed in to change notification settings

metashot/prok-pan

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

metashot/prok-pan

metashot/prok-pan is a workflow for pan genome analysis of closely related prokariotic genomes, mitochondria, and viruses.

Main features

  • Input: prokaryotic genomes in FASTA format;
  • Rapid prokaryotic genome annotation using prokka;
  • Pan genome analysis and visualization using Roary;
  • Phylogenetic tree inference (core genome) using RAxML, optional.

Quick start

  1. Install Docker (or Singulariry) and Nextflow (see Dependences);
  2. Start running the analysis:
nextflow run metashot/prok-pan \
  --genomes "data/*.fa" \
  --outdir results

Parameters

See the file nextflow.config for the complete list of parameters.

Output

The files and directories listed below will be created in the results directory after the pipeline has finished.

Main outputs

  • roary: Roary output files. This folder includes summary_statistics.txt (number of genes in the core and accessory), gene_presence_absence.csv and the pangenome plots (pangenome_*.png).

Secondary outputs

  • prokka: the prokka output for each input sample;
  • raxml: RAxML output (when --skip_core_tree = false).

Documentation

Phylogenetic tree

If --skip_core_tree = false the phylogenetic tree is inferred from the core genome alignment using te default RaxML tree search algorithm1 The following RAxML parameters will be used:

-f d -m GTRCAT -N [RAXML_NSEARCH]

System requirements

Please refer to System requirements for the complete list of system requirements options.


1: Stamatakis A., Blagojevic F., Nikolopoulos D.S. et al. Exploring New Search Algorithms and Hardware for Phylogenetics: RAxML Meets the IBM Cell. J VLSI Sign Process Syst Sign Im 48, 271–286 (2007). Link.

About

Pan genome analysis of closely related prokariotic genomes

Resources

License

Stars

Watchers

Forks

Packages

No packages published