Packed QKV and Rotary Embedding Support for sm<80 GQA #20012
Merged
Azure Pipelines / Linux CPU CI Pipeline (x64 Linux_Debug)
succeeded
Mar 23, 2024 in 31m 6s
x64 Linux_Debug succeeded
Loading