A framework for distributed graph computation and machine learning at wechat scale, for more details, see 柏拉图简介 | Plato Introduction.
Authors(In alphabetical order): Benli Li, Conghui He, Donghai Yu, Pin Gao, Shijie Sun, Wenqiang Wu, Wanjing Wei, Xing Huang, Xiaogang Tu, Yongan Li.
Contact: [email protected]
Special thanks to Xiaowei Zhu and many for their work Gemini which served as an inspiration and guide in building plato‘s dualmode-engine. Thanks to Ke Yang and many for their work KnightKing which served as foundation of plato's walk-engine.
To simplify installation, Plato currently downloads and builds most of its required dependencies by calling 3rdtools.sh
. You should call it at least once before any build operations.
There are however, a few dependencies which must be manually satisfied.
- GCC
- At least 4.8.5 for C++11 support.
- MPICH-3
- Required for compiling and run Plato.
- OpenMP
- Required for compiling and run Plato.
- Bazel-0.26
- Required for compiling.
Plato was developed and tested on x86_64 cluster and Centos 7.0. Theoretically, it can be ported to other Linux distribution easily.
BAZEL_LINKOPTS=-static-libstdc++ CC=/your_mpi_location/mpicxx bazel build example/...
BAZEL_LINKOPTS=-static-libstdc++ CC=/your_mpi_location/mpicxx LD_LIBRARY_PATH=${LD_LIBRARY_PATH}:${PWD}/3rd/hadoop2/lib bazel test --test_env=LD_LIBRARY_PATH plato/...
Prerequisite:
- A cluster which can submit MPI programs(Hydra is a feasible solution).
- An accessible HDFS where Plato can find its input and put output on it.
A sample submit script was locate in here, modify it based on your cluster's environment and run.
./scripts/run_pagerank.sh