Skip to content

Commit

Permalink
tune workgroup size
Browse files Browse the repository at this point in the history
  • Loading branch information
qjia7 committed Oct 12, 2024
1 parent 6f9845d commit ed571b6
Showing 1 changed file with 2 additions and 2 deletions.
4 changes: 2 additions & 2 deletions js/web/lib/wasm/jsep/webgpu/ops/matmulnbits.ts
Original file line number Diff line number Diff line change
Expand Up @@ -288,8 +288,8 @@ export const createMatMulNBitsBlockSize32ProgramInfo = (
const components = 1;
const outputShape = batchDims.concat([dimAOuter, dimBOuter]);

const workgroupSize = 64;
const workgroupY = 4;
const workgroupSize = 128;
const workgroupY = 8;
const workgroupX = workgroupSize / workgroupY;
const tileSize = workgroupX * bComponents * 8; // each uint32 has 8 data.
const aLengthPerTile = tileSize / aComponents;
Expand Down

0 comments on commit ed571b6

Please sign in to comment.