Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix build break with cuda 12.2 #17922

Merged
merged 1 commit into from
Oct 13, 2023
Merged

Fix build break with cuda 12.2 #17922

merged 1 commit into from
Oct 13, 2023

Conversation

yufenglee
Copy link
Member

Description

nvcc 12.2 crashes while building onnxruntime/onnxruntime/contrib_ops/cuda/bert/flash_attention/flash_fwd_* for SM<8.0. nvcc 18.8 works though. It should be a bug in nvcc 12.2.

This PR excludes building flashattention for arch < 800.

@snnn snnn requested a review from jchen351 October 12, 2023 21:06
jchen351
jchen351 previously approved these changes Oct 12, 2023
aciddelgado
aciddelgado previously approved these changes Oct 12, 2023
@yufenglee
Copy link
Member Author

Please don't merge the PR. It fails a local test.

@yufenglee yufenglee dismissed stale reviews from aciddelgado and jchen351 via 8f3a3ae October 13, 2023 05:53
@yufenglee yufenglee force-pushed the yufeng/build_break_122 branch from 6b213eb to 8f3a3ae Compare October 13, 2023 05:53
@yufenglee yufenglee merged commit 7551dd0 into main Oct 13, 2023
91 checks passed
@yufenglee yufenglee deleted the yufeng/build_break_122 branch October 13, 2023 17:21
jchen351 pushed a commit that referenced this pull request Oct 18, 2023
### Description
<!-- Describe your changes. -->
nvcc 12.2 crashes while building
onnxruntime/onnxruntime/contrib_ops/cuda/bert/flash_attention/flash_fwd_*
for SM<8.0. nvcc 18.8 works though. It should be a bug in nvcc 12.2.

This PR excludes building flashattention for arch < 800.
@faxu faxu added triage:approved Approved for cherrypicks for release sdxl_llama labels Oct 25, 2023
tianleiwu pushed a commit that referenced this pull request Oct 31, 2023
### Description
<!-- Describe your changes. -->
nvcc 12.2 crashes while building
onnxruntime/onnxruntime/contrib_ops/cuda/bert/flash_attention/flash_fwd_*
for SM<8.0. nvcc 18.8 works though. It should be a bug in nvcc 12.2.

This PR excludes building flashattention for arch < 800.
@tianleiwu tianleiwu removed triage:approved Approved for cherrypicks for release release:1.16.2 labels Nov 1, 2023
kleiti pushed a commit to kleiti/onnxruntime that referenced this pull request Mar 22, 2024
### Description
<!-- Describe your changes. -->
nvcc 12.2 crashes while building
onnxruntime/onnxruntime/contrib_ops/cuda/bert/flash_attention/flash_fwd_*
for SM<8.0. nvcc 18.8 works though. It should be a bug in nvcc 12.2.

This PR excludes building flashattention for arch < 800.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

6 participants