- Snakemake
- Roary
- PGAP
- Panaroo
- ncbi-blast+
- R
1- Git clone
git clone https://github.com/SouthGreenPlatform/PanExplorer_workflow.git
2- Define the PANEX_PATH environnement variable
cp -rf PanExplorer_workflow /usr/local/bin
export PANEX_PATH=/usr/local/bin/PanExplorer_workflow
3- Get preformatted RPS-BLAST+ database of the CDD COG distribution
wget https://ftp.ncbi.nlm.nih.gov/pub/mmdb/cdd/little_endian/Cog_LE.tar.gz
tar -xzvf Cog_LE.tar.gz
cp -rf Cog.* $PANEX_PATH/COG
Edit a new file names "genbank_ids" listing the Genbank identifiers of complete assembled and annotated genomes. The file should look like this
cat genbank_ids
CP000235.1
CP001079.1
CP001759.1
CP015994.2
Creating a pangenome using Roary
snakemake --cores 1 -s $PANEX_PATH/Snakemake_files/Snakefile_wget_roary_heatmap_upset_COG
Creating a pangenome using PanACoTA
snakemake --cores 1 -s $PANEX_PATH/Snakemake_files/Snakefile_wget_panacota_heatmap_upset_COG
Creating a pangenome using PGAP
snakemake --cores 1 -s $PANEX_PATH/Snakemake_files/Snakefile_wget_PGAP_heatmap_upset_COG