Skip to content

Commit

Permalink
typo
Browse files Browse the repository at this point in the history
  • Loading branch information
Michael Gschwind committed Apr 16, 2024
1 parent 740c15d commit 0711d94
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion quantize.py
Original file line number Diff line number Diff line change
Expand Up @@ -704,7 +704,7 @@ def linear_forward_int4(x, weight_int4pack, scales_and_zeros, out_features, grou
# print("MPS workaround active, will produce bogus results")
if "mps" in str(x.device):
new_shape = origin_x_size[:-1] + (out_features,)
return torch.zero(new_shape, dtype=x.dtype, device=x.device)
return torch.zeros(new_shape, dtype=x.dtype, device=x.device)

c = torch.ops.aten._weight_int4pack_mm(
x.to(torch.bfloat16), # TODO: should probably make a warning if x is not already bfloat16
Expand Down

0 comments on commit 0711d94

Please sign in to comment.