From ea1097952a1cb2f49b1ca5d15e3ee4aed99fea42 Mon Sep 17 00:00:00 2001 From: Alexander Kozlov Date: Mon, 25 Nov 2024 10:52:33 +0400 Subject: [PATCH] Update docs/source/openvino/export.mdx --- docs/source/openvino/export.mdx | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/docs/source/openvino/export.mdx b/docs/source/openvino/export.mdx index d374d38a2..118313999 100644 --- a/docs/source/openvino/export.mdx +++ b/docs/source/openvino/export.mdx @@ -88,7 +88,7 @@ Optional arguments: weight compression is applied, they are compressed to INT8. --awq Whether to apply AWQ algorithm. AWQ improves generation quality of INT4-compressed LLMs, but requires additional time for tuning weights on a calibration dataset. To run AWQ, please also - provide a dataset argument. Note: it's possible that there will be no matching patterns in the + provide a dataset argument. Note: it is possible that there will be no matching patterns in the model to apply AWQ, in such case it will be skipped. --scale-estimation Indicates whether to apply a scale estimation algorithm that minimizes the L2 error between the original and compressed layers. Providing a dataset is required to run scale estimation.