Copyright 2018, 2019 The Ohio State University
Simon Zhang, Mengbai Xiao, Chengxin Guo, Liang Geng, Hao Wang, Xiaodong Zhang
HYPHA makes calls to the PHAT library [3]
, developed by:
Ulrich Bauer, Michael Kerber, Jan Reininghaus, Hubert Wagner
HYPHA is a hybrid computing engine executed by both GPU and multicore to achieve high performance for computing persistent homology matrix reduction [6]
.
The essential foundation of our algorithm design and implementation is the separation of SIMT and MIMD parallelisms in Persistent Homology (PH) matrix reduction computation. With such a separation, we are able to perform massive parallel scanning operations on GPU in a super-fast manner, which also collects rich information from an input boundary matrix for further parallel reduction operations on multicore with high efficiency.
For detailed information about the framework and algorithms design of HYPHA, please refer to the paper in ACM ICS 2019 [8]
.
Dependencies:
-
Make sure cmake is installed at the correct version for the cmake file (e.g. cmake 2.8.12.2)
-
Make sure CUDA is installed at the correct version for the cmake file (e.g. CUDA 10.0.130)
-
Make sure GCC is installed at the correct version for the cmake file (e.g. GCC 7.3.0)
HYPHA is intended to run on high performance computing systems
Thus, a GPU with enough device memory is needed to run large datasets. (e.g. Tesla V100 GPU with 16GB device DRAM). If the system's GPU is not compatible, or the system does not have a GPU, error messages will appear.
Furthermore, it is also preferable to have a multicore processor (e.g. >= 28 cores) for effective computation, and a large amount of DRAM is required for large datasets. We have tested on a 192 GB DRAM single computing node with 40 cores.
If Unix-based operating system is used, the following actions are needed for installation:
type the following sequence of commands:
git clone https://github.com/simonzhang00/hypha.git
cd hypha
mkdir build
cd build && cmake .. && make
To optionally install from scratch for Microsoft Visual Studio 2017,
type the following sequence of commands:
git clone https://github.com/simonzhang00/hypha.git
cd hypha
mkdir build
cd build
cmake -G "Visual Studio 15 Win64" ..
There should be a hypha.sln file in the build folder. Open hypha.sln in Microsoft Visual Studio 2017 and set Release
in the Solution Configurations drop down box and x64
in the Solution Platforms drop down box. Build the hypha.sln file in Visual Studio 2017 by right clicking Solution 'hypha'
from the Solution Explorer and selecting build. The executables should appear in the build/Release folder.
On Unix-based systems, after running make from the build folder, first untar the examples folder
cd ..
tar -xvf examples.tar.gz
cd build
then run the following command to run a computationally involved dataset:
./hypha --spectral_sequence ../examples/high_genus_extended.bin high_genus_extended-pairs.res
For Microsoft Visual Studio 2017, after building the executables in the build/Release folder, go to the build folder and execute the following commands to untar the examples folder
cd ..
tar -xvf examples.tar.gz
cd build/Release
then run the following command to run a computationally involved dataset:
./hypha --spectral_sequence ../../examples/high_genus_extended.bin high_genus_extended-pairs.res
On the 40 core supercomputing node with a Tesla V100 GPU that we ran experiments on, the above matrix reduction takes 1.91 seconds.
We have provided a couple of datasets ranging from small (single_triangle, neptune, Stanfordbunny) to computationally minimal (18-sphere) to computationally more involved datasets (torus, high_genus_extended). This characterization depends on the number of simplices, the computational difficulty of the "tail-columns", and, in general, the distribution of column additions across the boundary matrix discussed in our paper[8]
. One can obtain the number of simplices with the info
executable.
In general, to run in the build folder, type:
./hypha [options] inputfile outputfile
where inputfile and outputfile have paths relative to the build folder
options:
Usage: hypha [options] input_filename output_filename
Options:
--ascii -- use ascii file format
--binary -- use binary file format (default)
--help -- prints this screen
--verbose -- verbose is always on
--dual_input -- convert pairs to persistent homology pairs from reduction of a dualized input boundary matrix
--nocompression -- removes compression (may be beneficial with dualized input)
--vector_vector, --full_pivot_column, --sparse_pivot_column, --heap_pivot_column, --bit_tree_pivot_column
-- selects a representation data structure for boundary matrices (default is '--bit_tree_pivot_column')
--twist, --spectral_sequence
-- selects a final stage matrix reduction algorithm (default is '--twist')
CAUTION: if the wrong combination of options are typed, HYPHA's behavior will be undefined or it may recognize the right most option or will print the above help screen.
HYPHA is fully compatible with PHAT's input boundary matrix binary and ascii inputs as well as the binary and ascii persistence pairs output files. See [3]
https://github.com/blazs/phat for a description of the binary and ascii inputs.
See the PHAT project's benchmark datasets here:
https://drive.google.com/uc?id=0B7Yz6TPEpiGERGZFbjlXaUt1ZWM&export=download
To generate datasets see the project by Bauer, Kerber, Reininghaus:
[2]
https://github.com/DIPHA/dipha
Notice that DIPHA may require a lower version of openmpi than currently installed on the cluster. e.g. openmpi version 3.0.1
In the dipha/matlab folder, there are matlab scripts to create a .complex file
e.g. for a large dataset: create_smooth_image_data(100)
will generate smooth_100.complex
Then in the dipha/build folder, after successfully building the dipha cmake project, run ./create_phat_filtration ../matlab/inputfile.complex outputfile.bin
e.g. run ./create_phat_filtration ../matlab/smooth_100.complex smooth_100.bin
then move the .bin file to your examples folder under HYPHA
to generate anti-transposed (dualized) matrices [7]
, turn on the --dual option when running ./create_phat_filtration
e.g. run ./create_phat_filtration --dual ../matlab/smooth_100.complex smooth_100-DUAL.bin
To convert datasets from/to binary to/from ascii and with/without dualizing, use the convert executable available in the build folder upon successfully building.
e.g. run ./convert --binary --save-binary --dualize inputfile.bin inputfileDUAL.bin
Note: In special cases [4]
, reducing a dualized matrix can significantly reduce the amount of computation needed and thus may give better performance with the --nocompression option on. HYPHA performs compression [1]
by default.
Here is a brief list of related PH computation software packages: (ordered alphabetically)
@inproceedings{zhang2019hypha,
title={HYPHA: a framework based on separation of parallelisms to accelerate persistent homology matrix reduction},
author={Zhang, Simon and Xiao, Mengbai and Guo, Chengxin and Geng, Liang and Wang, Hao and Zhang, Xiaodong},
booktitle={Proceedings of the ACM International Conference on Supercomputing},
pages={69--81},
year={2019},
organization={ACM}
}
- U. Bauer, M. Kerber, J. Reininghaus: Clear and Compress: Computing Persistent Homology in Chunks. Topological methods in data analysis and visualization III. Springer, Cham, 2014. 103-117.
- U. Bauer, M. Kerber, J. Reininghaus: Distributed computation of persistent homology. Proceedings of the Sixteenth Workshop on Algorithm Engineering and Experiments (ALENEX), 2014.
- U. Bauer, M. Kerber, J. Reininghaus, H. Wagner: PHAT – Persistent Homology Algorithms Toolbox. Mathematical Software – ICMS 2014, Lecture Notes in Computer Science Volume 8592, 2014, pp 137-143
- U. Bauer, “Ripser/Ripser.” GitHub, 6 Oct. 2018, github.com/Ripser/ripser.
- C. Chen, M. Kerber: Persistent Homology Computation With a Twist. 27th European Workshop on Computational Geometry, 2011.
- H. Edelsbrunner, J. Harer: Computational Topology, An Introduction. American Mathematical Society, 2010, ISBN 0-8218-4925-5
- V. de Silva, D. Morozov, M. Vejdemo-Johansson: Dualities in persistent (co)homology. Inverse Problems 27, 2011
- S. Zhang, M. Xiao, C. Guo, L. Geng, H. Wang, X. Zhang: HYPHA: a Framework based on Separation of Parallelisms to Accelerate Persistent Homology Matrix Reduction. Proceedings of the 2019 International Conference on Supercomputing. ACM, 2019.