Packed QKV and Rotary Embedding Support for sm<80 GQA #25313
windows.yml
on: pull_request
Windows-CUDA-12
53m 41s
Onnxruntime-TVM
1h 9m