Packed QKV and Rotary Embedding Support for sm<80 GQA #20012
Merged
Azure Pipelines / Windows CPU CI Pipeline (x64_debug build_x64_debug)
succeeded
Mar 23, 2024 in 31m 19s
x64_debug build_x64_debug succeeded
Loading