[CUDA] GroupQueryAttention operator using FlashAttention #17988
Triggered via pull request
October 4, 2023 15:32
Status
Success
Total duration
2h 17m 34s
Artifacts
–
This run and associated checks have been archived and are scheduled for deletion.
Learn more about checks retention
windows.yml
on: pull_request
Windows-CUDA-12
23m 52s
Onnxruntime-TVM
2h 17m