-
Supported operating systems: currently, Linux and macOS only.
-
Clone GitHub repository:
git clone https://github.com/amkozlov/cellphy
- Install dependencies
CellPhy is using bundled RAxML-NG to perform tree inference, hence this core functionality is available without installing any dependencies. However, some additional features (tree visualization, format conversion etc.) rely on external software. So in order to have a fully functional pipeline, it is highly recommended to install following packages:
- bcftools
- R environment
- R packages: ggtree and its dependencies (treeio, ape...)
Your installation options depend on the system:
a) For Ubuntu and RedHat/CentOS Linux with root access, we provide a script for automatic dependency installation:
cd cellphy
sudo ./install.sh
b) Under macOS, you can use homebrew:
brew install r
brew install bcftools
c) In a cluster environment, chances are the required packages are already installed and can be loaded with the module
command, e.g.:
module load bcftools R
- Check that everything works:
./cellphy.sh
Standard analysis with default parameters:
./cellphy.sh input.vcf
General syntax:
./cellphy.sh [COMMAND] [options] input.vcf
COMMAND:
-
FULL
Full analysis: Thorough tree search + bootstrapping + mutation mapping (default) -
SEARCH
Thorough tree search (20 starting trees) + mutation mapping -
FAST
Fast tree search (single starting tree) + mutation mapping
Options:
-
-a
Use approximate 10-state model (~2x faster) -
-g FILE
Tab-delimited list of SNVs for mapping, with respective gene names (example) -
-m MODEL
Evolutionary model definition in RAxML-NG format
NOTE: partitioned models are not supported at the moment! -
-o OUTGR
Outgroup taxon list (comma-separated), e.g.-o healthy
or-o H1,H2
-
-p PREFIX
Prefix for output files (default: input filename w/o extension) -
-r
REDO mode: overwrite all result files -
-t THREADS
Number of threads to use (default: autodetect) -
-y
Skip mutation mapping -
-z
Skip tree plotting (no PDF/SVG output)
For some advanced usage examples, please see tutorial.
When using CellPhy, please cite this paper:
Alexey Kozlov, João M Alves, Alexandros Stamatakis, and David Posada (2022) CellPhy: accurate and fast probabilistic inference of single-cell phylogenies from scDNA-seq data Genome Biol 23, 37 doi: 10.1186/s13059-021-02583-w
When using tree visualization features (e.g., mutation map), please additionally cite ggtree:
G Yu, DK Smith, H Zhu, Y Guan, TTY Lam (2017) ggtree: an R package for visualization and annotation of phylogenetic trees with their covariates and other associated data. Methods in Ecology and Evolution. 2017, 8(1):28-36. doi: 10.1111/2041-210X.12628