Skip to content

Commit

Permalink
Update README.md
Browse files Browse the repository at this point in the history
fix typo
  • Loading branch information
mikekgfb authored Apr 5, 2024
1 parent ea4d370 commit 4491dd9
Showing 1 changed file with 1 addition and 0 deletions.
1 change: 1 addition & 0 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -363,6 +363,7 @@ python export.py --checkpoint-path ${MODEL_PATH} -d fp32 --quant "{'linear:int8'
Now you can run your model with the same command as before:
```
python generate.py --pte-path ${MODEL_DIR}/${MODEL_NAME}_int8-gw256.so --checkpoint-path ${MODEL_PATH} -d fp32 --prompt "Hello my name is"
```

Please note that group-wise quantization works functionally, but has
not been optimized for CUDA and CPU targets where the best
Expand Down

0 comments on commit 4491dd9

Please sign in to comment.