Packed QKV and Rotary Embedding Support for sm<80 GQA (#20012) #25331
windows.yml
on: push
Windows-CUDA-12
52m 38s
Onnxruntime-TVM
1h 8m