Skip to content

Commit

Permalink
Update quantization.md
Browse files Browse the repository at this point in the history
Minor typo fix. 
Quantization Method GTPQ -> GPTQ
  • Loading branch information
yy-shin authored Sep 25, 2024
1 parent bc1bce9 commit 5fe9c11
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion docs/performance/model-optimizations/quantization.md
Original file line number Diff line number Diff line change
Expand Up @@ -282,7 +282,7 @@ quant.model.save_model_to_file(

```

For AWQ and GTPQ quantization usage, please refer to [Gen-AI model builder](https://github.com/microsoft/onnxruntime-genai/tree/main/src/python/py/models#quantized-pytorch-model).
For AWQ and GPTQ quantization usage, please refer to [Gen-AI model builder](https://github.com/microsoft/onnxruntime-genai/tree/main/src/python/py/models#quantized-pytorch-model).

## FAQ
### Why am I not seeing performance improvements?
Expand Down

0 comments on commit 5fe9c11

Please sign in to comment.