[QNN Quant] Ensure 16bit tensor quant overrides set MS domain #19684

adrianlizarraga · 2024-02-28T01:19:51Z

Description

Ensures that DQ and Q ops use the msft domain if tensor quantization overrides specify 16-bit integer types.

Motivation and Context

ONNX does not yet support 16bit integer types for QuantizeLinear and DequantizeLinear ops (coming soon). For now, DQ/Q ops must use the MSFT domain.

We have to also check if tensor quantization overrides force the use of 16-bit quantization types. If so, we must correctly set the domain for Q/DQ ops.

onnxruntime/python/tools/quantization/onnx_quantizer.py

snnn · 2024-02-28T01:40:56Z

/azp run Windows GPU TensorRT CI Pipeline

azure-pipelines · 2024-02-28T01:41:04Z

Azure Pipelines successfully started running 1 pipeline(s).

…rides

…oft#19684) ### Description Ensures that DQ and Q ops use the msft domain if tensor quantization overrides specify 16-bit integer types. ### Motivation and Context ONNX does not yet support 16bit integer types for QuantizeLinear and DequantizeLinear ops (coming soon). For now, DQ/Q ops must use the MSFT domain. We have to also check if tensor quantization overrides force the use of 16-bit quantization types. If so, we must correctly set the domain for Q/DQ ops.

Ensure 16bit tensor quant overrides set MS domain

3aaa783

adrianlizarraga commented Feb 28, 2024

View reviewed changes

onnxruntime/python/tools/quantization/onnx_quantizer.py Show resolved Hide resolved

adrianlizarraga requested a review from xadupre February 28, 2024 01:23

adrianlizarraga marked this pull request as ready for review February 28, 2024 01:23

Merge branch 'main' into adrianl/set-ms-domain-from-tensor-quant-over…

3b3468c

…rides

adrianlizarraga requested review from jywu-msft and HectorSVC February 28, 2024 16:33

xadupre approved these changes Feb 28, 2024

View reviewed changes

Merge latest main commits to fix pipeline

a5140c0

adrianlizarraga merged commit c1bf7fc into main Feb 29, 2024
88 of 91 checks passed

adrianlizarraga deleted the adrianl/set-ms-domain-from-tensor-quant-overrides branch February 29, 2024 09:19

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[QNN Quant] Ensure 16bit tensor quant overrides set MS domain #19684

[QNN Quant] Ensure 16bit tensor quant overrides set MS domain #19684

adrianlizarraga commented Feb 28, 2024 •

edited

Loading

snnn commented Feb 28, 2024

azure-pipelines bot commented Feb 28, 2024

[QNN Quant] Ensure 16bit tensor quant overrides set MS domain #19684

[QNN Quant] Ensure 16bit tensor quant overrides set MS domain #19684

Conversation

adrianlizarraga commented Feb 28, 2024 • edited Loading

Description

Motivation and Context

snnn commented Feb 28, 2024

azure-pipelines bot commented Feb 28, 2024

adrianlizarraga commented Feb 28, 2024 •

edited

Loading