-
Notifications
You must be signed in to change notification settings - Fork 295
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Fix colfax_cutlass flash_attention operator (#2401)
Summary: colfax_cutlass kernels will fail because of C++ template instantiation. We need to explicitly include the header file to instantiate all template parameters. Pull Request resolved: #2401 Test Plan: Install the colfax_cutlass operators: ``` python install.py --userbenchmark triton --cutlass /home/xz/git/benchmark/submodules/cutlass-kernels/src/fmha/fmha_forward.cu(826): warning #117: non-void function "main" should return a value return; ^ Remark: The warnings can be suppressed with "-diag-suppress <warning-number>" /home/xz/git/benchmark/submodules/cutlass-kernels/src/fmha/fmha_forward.cu(826): warning #117: non-void function "main" should return a value return; ^ Remark: The warnings can be suppressed with "-diag-suppress <warning-number>" ``` Run the flash_attention operator from colfax_cutlass ``` python run_benchmark.py triton --op flash_attention --only colfax_cutlass --num-inputs 1 (Batch, Heads, SeqLen, Dhead) colfax_cutlass-latency ------------------------------- ------------------------ (32, 32, 512, 64) 0.001024 ``` Reviewed By: manman-ren Differential Revision: D60557212 Pulled By: xuzhao9 fbshipit-source-id: 25b216f850d2e82815041059d372627806bfd3ca
- Loading branch information
1 parent
f4ed185
commit 0a2ff22
Showing
4 changed files
with
22 additions
and
50 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
24 changes: 0 additions & 24 deletions
24
userbenchmark/triton/cutlass_kernels/include/fmha_forward.h
This file was deleted.
Oops, something went wrong.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters