Skip to content

[CUDA] GroupQueryAttention operator using FlashAttention #19106

[CUDA] GroupQueryAttention operator using FlashAttention

[CUDA] GroupQueryAttention operator using FlashAttention #19106

Triggered via pull request October 4, 2023 15:32
Status Success
Total duration 1h 39m 25s
Artifacts
This run and associated checks have been archived and are scheduled for deletion. Learn more about checks retention

linux.yml

on: pull_request
Onnxruntime-TVM
1h 39m
Onnxruntime-TVM
Fit to window
Zoom out
Zoom in