Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Fix MoE tensor parallelism tests (#20147)
### Description <!-- Describe your changes. --> Previously the expert weights are in row-major. But with the updated cutlass extension introduced by #20108, weights are stored in col-major that aligns with Pytorch implementation. This change fixes the way the tensors are sliced across shards. ### Motivation and Context <!-- - Why is this change required? What problem does it solve? - If it fixes an open issue, please link to the issue here. -->
- Loading branch information