GTC 2024 Talk
#1410
Replies: 1 comment
-
I consider FlashAttention: Fast and Memory-Efficient Exact Attention With IO-Awareness [S62546] from @tridao on Mar 20 also a CUTLASS talk. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
There are two GTC talks in 2024
This year GTC is both in person and virtual. If you want to meet core CUTLASS developers in person, please let us know.
Beta Was this translation helpful? Give feedback.
All reactions