-
Notifications
You must be signed in to change notification settings - Fork 0
OpenCL GPU support
Gpu_stanmathcl branch of this library provides GPU support. It is based on OpenCL, an open-source framework for writing programs that utilize a platform with heterogeneous hardware. OpenCL is used to speed-up time consuming routines used by Stan. Currently, matrix multiplication and Cholesky decomposition and their derivatives are supported. Other routines will be available in the future. These routines are suitable for programs which require solving large NxM
matrices (N>600
) such as algorithms that utilize large covariance matrices.
Users must have suitable hardware (e.g. Nvidia or AMD gpu), valid OpenCL driver, SDK and a suitable C/C++ compiler installed on their computer.
#Installation
##Linux The following guide is for Ubuntu, but it should be similar for any other Linux distribution. You should have the GNU compiler suite or clang compiler installed beforehand.
Install the Nvidia CUDA toolkit and clinfo tool if you have a Nvidia GPU
apt update
apt install nvidia-cuda-toolkit clinfo
Those with AMD devices can install the OpenCL driver available through
apt install -y libclc-amdgcn mesa-opencl-icd clinfo
If your device is not supported by the current drivers available you can try Paulo Miguel PPA
add-apt-repository ppa:paulo-miguel-dias/mesa
apt-get update
apt-get install libclc-amdgcn mesa-opencl-icd
##MacOS Mac's should already have the OpenCL driver installed if you have the appropriate hardware.
##Windows Install the latest Rtools suite if you don't already have it. During the installation make sure that the 64 bit toolchain is installed.
If you have a Nvidia card, install the latest Nvidia CUDA toolkit. AMD users should use AMD APP SDK
Users can check that their installation is valid by running clinfo.
To turn on GPU computation:
- Check and record what device and platform you would like to use with clinfo and record the device and platform index.
- Open a text file called make/local; if it does not exist, create one.
- Add these lines to the make/local file:
STAN_OPENCL=true
OPENCL_DEVICE_ID=${CHOSEN_INDEX}
OPENCL_PLATFORM_ID=${CHOSEN_INDEX}
where the user will replace ${CHOSEN_INDEX} with the index of the device and platform they would like to use. If you are on Windows append the following lines at the end of the make/local file in order to link with the appropriate OpenCL library:
Nvidia `` CC = g++ LDFLAGS_OPENCL= -L"$(CUDA_PATH)lib\x64" -lOpenCL
AMD
CC = g++ LDFLAGS_OPENCL= -L"$(AMDAPPSDKROOT)lib\x64" -lOpenCL
# Running Tests with OpenCL
Once you have done the above step, `runTests.py` should execute with the GPU enabled. All tests will match the phrase `*_opencl_*` and tests can be filtered such as
```bash
./runTests.py test/unit -f opencl
TODO