Skip to content

CellPhy: accurate and fast probabilistic inference of single-cell phylogenies

License

Notifications You must be signed in to change notification settings

amkozlov/cellphy

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

37 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

CellPhy: accurate and fast probabilistic inference of single-cell phylogenies

Installation

  1. Supported operating systems: currently, Linux and macOS only.

  2. Clone GitHub repository:

git clone https://github.com/amkozlov/cellphy
  1. Install dependencies

CellPhy is using bundled RAxML-NG to perform tree inference, hence this core functionality is available without installing any dependencies. However, some additional features (tree visualization, format conversion etc.) rely on external software. So in order to have a fully functional pipeline, it is highly recommended to install following packages:

Your installation options depend on the system:

a) For Ubuntu and RedHat/CentOS Linux with root access, we provide a script for automatic dependency installation:

cd cellphy
sudo ./install.sh  

b) Under macOS, you can use homebrew:

brew install r
brew install bcftools

c) In a cluster environment, chances are the required packages are already installed and can be loaded with the module command, e.g.:

module load bcftools R
  1. Check that everything works:
./cellphy.sh

Usage

Standard analysis with default parameters:

./cellphy.sh input.vcf

General syntax:

./cellphy.sh [COMMAND] [options] input.vcf

COMMAND:

  • FULL Full analysis: Thorough tree search + bootstrapping + mutation mapping (default)

  • SEARCH Thorough tree search (20 starting trees) + mutation mapping

  • FAST Fast tree search (single starting tree) + mutation mapping

Options:

  • -a Use approximate 10-state model (~2x faster)

  • -g FILE Tab-delimited list of SNVs for mapping, with respective gene names (example)

  • -m MODEL Evolutionary model definition in RAxML-NG format
    NOTE: partitioned models are not supported at the moment!

  • -o OUTGR Outgroup taxon list (comma-separated), e.g. -o healthy or -o H1,H2

  • -p PREFIX Prefix for output files (default: input filename w/o extension)

  • -r REDO mode: overwrite all result files

  • -t THREADS Number of threads to use (default: autodetect)

  • -y Skip mutation mapping

  • -z Skip tree plotting (no PDF/SVG output)

Tutorial

For some advanced usage examples, please see tutorial.

Citations

When using CellPhy, please cite this paper:

Alexey Kozlov, João M Alves, Alexandros Stamatakis, and David Posada (2022) CellPhy: accurate and fast probabilistic inference of single-cell phylogenies from scDNA-seq data Genome Biol 23, 37 doi: 10.1186/s13059-021-02583-w

When using tree visualization features (e.g., mutation map), please additionally cite ggtree:

G Yu, DK Smith, H Zhu, Y Guan, TTY Lam (2017) ggtree: an R package for visualization and annotation of phylogenetic trees with their covariates and other associated data. Methods in Ecology and Evolution. 2017, 8(1):28-36. doi: 10.1111/2041-210X.12628

About

CellPhy: accurate and fast probabilistic inference of single-cell phylogenies

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Contributors 3

  •  
  •  
  •