-
Beta Was this translation helpful? Give feedback.
Answered by
hwu36
Nov 16, 2022
Replies: 1 comment 6 replies
-
You are very good! cublas has 2 in its grid.z which means it uses splitK. To do apple to apple comparison, you need to do the same. Note, different version of cutlass and compiler will cause some performance difference. |
Beta Was this translation helpful? Give feedback.
6 replies
Answer selected by
MARD1NO
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
You are very good!
cublas has 2 in its grid.z which means it uses splitK. To do apple to apple comparison, you need to do the same.
Note, different version of cutlass and compiler will cause some performance difference.