Skygrid-Protocol

Input and output files accompanying the Skygrid protocol paper 'Bayesian estimation of past population dynamics in BEAST 1.10 using the Skygrid coalescent model'

The folder 0.alignment contains the files for preparing the initial alignment of 200 Ebolavirus sequences from Sierra Leone:

200_unaligned.fasta: the 200 sequences as downloaded from GenBank
200_alignment_unclean.fasta: result of aligning the sequences using MAFFT
200_final.fasta: final alignment after cleaning, with the various cleaning steps discussed in Supplementary Materials
KR105247.gb: example GenBank entry of the Ebolavirus sequence with accession number KR105247

The folder 1.tempest contains the following input files for generating a maximum-likelihood tree using IQ-TREE:

200_final.fasta: the multiple sequence alignment generated in the previous step
partitions.nex: NEXUS file (without any genetic data) delimiting the various partitions within the alignment

All other files within the folder are output files generated by IQ-TREE, with the .treefile containing the maximum-likelihood (ML) phylogeny

The folder 2.treetime contains three subfolders with files corresponding to three different steps when generating a time-stamped ML phylogeny:

1.iqtree-196taxa: input files (196_final.fasta and partitions.nex) to generate the ML phylogeny of 196 Ebolavirus sequences using IQ-TREE, as well as the output files
2.treetime-inputfiles: input files to run TreeTime on the 196-taxa data set
3.treetime-outputfiles: output files from TreeTime

The folder 3.beauti contains two FASTA files with genetic data, after removing 4 sequences in the TempEst analysis:

196_CDS.fasta: multiple sequence alignment for the coding region
196_IG.fasta: multiple sequence alignment for the aggregated intergenic regions

These 2 files need to be imported into BEAUti in order to setup the various model components for the BEAST analysis.

The folder 4.beast contains the BEAST XML file that was generated using BEAUti.

Finally, the folder 5.tracer contains the output files of 2 independent BEAST analyses of the generated XML file:

BEAST_100m_run.1.log: first replicate of the BEAST analysis, with starting seed 123
BEAST_100m_run.2.log: second replicate of the BEAST analysis, with starting seed 456

These files need to be loaded into Tracer in order to assess convergence and proper mixing of the analyses, as well as determining burn-in. Note that we here do not provide the .trees files generated during the BEAST analysis, as these are not needed to reconstruct the past population dynamics using the Skygrid coalescent model.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Skygrid-Protocol

About

Releases

Packages

Contributors 2

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
0.alignment		0.alignment
1.tempest		1.tempest
2.treetime		2.treetime
3.beauti		3.beauti
4.beast		4.beast
5.tracer		5.tracer
README.md		README.md

GuyBaele/Skygrid-Protocol

Folders and files

Latest commit

History

Repository files navigation

Skygrid-Protocol

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Packages