-
Notifications
You must be signed in to change notification settings - Fork 27
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
build failure with nccl #121
Comments
This should be fixed in #130 |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
This is from trying to to update the spack package to 2.6.2 and provide NCCL/RCCL support, but it doesn't look as if it's related to spack. Building fails when I enable NCCL, but works without it; I'm puzzled why, as it must usually work.
The cmake args which fail (with openmpi-4.1.4, cuda-11.4.1, nccl-2.14.3-1) are
It succeeds when -DCOSMA_WITH_NCCL=ON is removed.
There are two different failures, depending on whether openmpi is built with C++ support.
With openmpi+cxx, the failure is
and without cxx it's
By the way, as something else to add, what exactly does COSMA_WITH_GPU_AWARE_MPI mean? In the case of openmpi, it could be configuring --with-cuda and/or using a UCX built with cuda and/or gdrcopy.
The text was updated successfully, but these errors were encountered: