Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[QNN Quant] Ensure 16bit tensor quant overrides set MS domain #19684

Merged

Conversation

adrianlizarraga
Copy link
Contributor

@adrianlizarraga adrianlizarraga commented Feb 28, 2024

Description

Ensures that DQ and Q ops use the msft domain if tensor quantization overrides specify 16-bit integer types.

Motivation and Context

ONNX does not yet support 16bit integer types for QuantizeLinear and DequantizeLinear ops (coming soon). For now, DQ/Q ops must use the MSFT domain.

We have to also check if tensor quantization overrides force the use of 16-bit quantization types. If so, we must correctly set the domain for Q/DQ ops.

@adrianlizarraga adrianlizarraga marked this pull request as ready for review February 28, 2024 01:23
@snnn
Copy link
Member

snnn commented Feb 28, 2024

/azp run Windows GPU TensorRT CI Pipeline

Copy link

Azure Pipelines successfully started running 1 pipeline(s).

@adrianlizarraga adrianlizarraga merged commit c1bf7fc into main Feb 29, 2024
88 of 91 checks passed
@adrianlizarraga adrianlizarraga deleted the adrianl/set-ms-domain-from-tensor-quant-overrides branch February 29, 2024 09:19
zz002 pushed a commit to zz002/onnxruntime that referenced this pull request Mar 7, 2024
…oft#19684)

### Description
Ensures that DQ and Q ops use the msft domain if tensor quantization
overrides specify 16-bit integer types.

### Motivation and Context
ONNX does not yet support 16bit integer types for QuantizeLinear and
DequantizeLinear ops (coming soon). For now, DQ/Q ops must use the MSFT
domain.

We have to also check if tensor quantization overrides force the use of
16-bit quantization types. If so, we must correctly set the domain for
Q/DQ ops.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants