Skip to content

Commit

Permalink
[WebGPU EP] Support GroupQueryAttention (microsoft#22658)
Browse files Browse the repository at this point in the history
### Description
<!-- Describe your changes. -->
Support GroupQueryAttention operator for native webgpu ep.


### Motivation and Context
<!-- - Why is this change required? What problem does it solve?
- If it fixes an open issue, please link to the issue here. -->
This is required for inferencing some LLMs.
  • Loading branch information
satyajandhyala authored and ankitm3k committed Dec 11, 2024
1 parent 3828c33 commit 30f8e7b
Show file tree
Hide file tree
Showing 3 changed files with 29 additions and 499 deletions.
Loading

0 comments on commit 30f8e7b

Please sign in to comment.