-
Notifications
You must be signed in to change notification settings - Fork 3k
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Move attention test data to file (#17158)
(1) Move attention test data from code to file to avoid prefast crash (which blocks python packaging pipeline) (2) Enable some test cases that previously disabled in Windows (3) Fix an assertion error in `MultiHeadAttentionTest.CrossAttention_WithPastPassedInDirectly_NoMask` This test case is for Whisper cross attention. When Memory efficient attention was enabled, format is converted to BNSH, which trigger assertion error since memory efficient attention asserts BSNH format. Temporarily disable memory efficient attention for this case. I also disabled the test since Whisper does not use it anymore, and ROCm fails in the test.
- Loading branch information
Showing
11 changed files
with
5,739 additions
and
6,472 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.