Skip to content

[CUDA] GroupQueryAttention operator using FlashAttention #1400

[CUDA] GroupQueryAttention operator using FlashAttention

[CUDA] GroupQueryAttention operator using FlashAttention #1400

Triggered via pull request October 3, 2023 17:20
Status Success
Total duration 18s
Artifacts
This run and associated checks have been archived and are scheduled for deletion. Learn more about checks retention
cuda build_x64_RelWithDebInfo
2s
cuda build_x64_RelWithDebInfo
dml build_x64_RelWithDebInfo
0s
dml build_x64_RelWithDebInfo
training build_x64_RelWithDebInfo
0s
training build_x64_RelWithDebInfo
kernelDocumentation build_x64_RelWithDebInfo
2s
kernelDocumentation build_x64_RelWithDebInfo
Fit to window
Zoom out
Zoom in