Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[backend] Fix improper mma->dot shortcut when warpsPerCTA[1] > 1 #4803

Merged
merged 1 commit into from
Sep 25, 2024

Conversation

chsigg
Copy link
Collaborator

@chsigg chsigg commented Sep 25, 2024

#4492 started causing an issue where chained MMAs on hopper would segfault with 8 warps. It seems that previously this was checked, but the check got removed in this PR and it's still unsupported.

Adding back this check means these MMAs will have to go back to shared memory, but it's better than segfaulting until it's actually supported.

Resolves openxla/xla#17356

triton-lang#4492 started causing an
issue where chained MMAs on hopper would segfault with 8 warps. It seems
that previously this was checked, but the check got removed in this PR
and it's still unsupported.

Adding back this check means these MMAs will have to go back to shared
memory, but it's better than segfaulting until it's actually supported.

Resolves openxla/xla#17356
@Jokeren
Copy link
Contributor

Jokeren commented Sep 25, 2024

Can you copy & paste me the mma->mma instruction and the layouts?

@Jokeren Jokeren self-assigned this Sep 25, 2024
@chsigg
Copy link
Collaborator Author

chsigg commented Sep 25, 2024

Can you copy & paste me the mma->mma instruction and the layouts?

Does openxla/xla#17356 (comment) contain what you need?

@Jokeren
Copy link
Contributor

Jokeren commented Sep 25, 2024

Oh, I think it's mma->dot instead of mma->mma. It's not supposed to be handled by my previous PR. Why did I take it out...I need some further investigation before getting back to you.

@Jokeren
Copy link
Contributor

Jokeren commented Sep 25, 2024

I agree with the fix. I actually didn't take out the getWarpsPerCTA, but the previous mmaToMmaShortcut is a misnomer.
@chsigg How about changing the title to:

Fix improper mma->dot shortcut when warpsPerCTA[1] > 1

@chsigg chsigg changed the title [backend] Don't segfault on unsupported mma->mma [backend] Fix improper mma->dot shortcut when warpsPerCTA[1] > 1 Sep 25, 2024
@Jokeren Jokeren merged commit 694719a into triton-lang:main Sep 25, 2024
7 checks passed
@vwbaker vwbaker deleted the warps branch September 26, 2024 07:43
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Pallas/Triton segfault on H100
3 participants