Skip to content

[CUDA] GroupQueryAttention operator using FlashAttention #17988

[CUDA] GroupQueryAttention operator using FlashAttention

[CUDA] GroupQueryAttention operator using FlashAttention #17988

Triggered via pull request October 4, 2023 15:32
Status Success
Total duration 2h 17m 34s
Artifacts
This run and associated checks have been archived and are scheduled for deletion. Learn more about checks retention

windows.yml

on: pull_request
Fit to window
Zoom out
Zoom in