Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Small update to gpu compiler flags to support compute capabilities 8.0 and 8.6 as well. #113

Merged

Conversation

Subhajit-Roy-Partho
Copy link
Contributor

This adds compiler flags for Ampere gpu with compute capabilities 8.0 and 8.6 as provided in https://github.com/Kitware/CMake/blob/master/Modules/FindCUDA/select_compute_arch.cmake.

@lorenzo-rovigatti
Copy link
Owner

Thanks! Those are some extensive changes. Did you change the file yourself or you used some other file (e.g. by NVIDIA) as template?

@Subhajit-Roy-Partho
Copy link
Contributor Author

Apologies for my delayed reply. It is mostly copied from the FindCUDA module of the CMAKE github and as given by nvidia cuda documentation. I didn't write it. I tried it out on our A100 gpu which are with compute capabilities 8.0. On average it only speed up by 1.01 using the above code.

@lorenzo-rovigatti
Copy link
Owner

Some of the changes I don't understand. Does it work on older architectures as well?

@Subhajit-Roy-Partho
Copy link
Contributor Author

The oldest GPU I have access to is GTX1080 and it works on it. But not sure about even older gpus.

@lorenzo-rovigatti lorenzo-rovigatti merged commit 21d7e48 into lorenzo-rovigatti:master Aug 6, 2024
1 check passed
lorenzo-rovigatti added a commit that referenced this pull request Aug 30, 2024
The bug would prevent compilation with `-DCUDA=On -D -DCUDA_COMMON_ARCH=OFF`
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants