[QNN EP/Quantization] Add MinimumRealRange extra option to quantization script #18278

adrianlizarraga · 2023-11-03T22:58:11Z

Description

Adds the extra option MinimumRealRange to the quantization script:

"""
MinimumRealRange= float|None :
                    Default is None. If set to a floating-point value, the calculation of the quantization parameters
                    (i.e., scale and zero point) will enforce a minimum range between rmin and rmax. If (rmax - rmin)
                    is less than the specified minimum range, rmax will be set to rmin + QuantMinRealRange. This is
                    necessary for EPs like QNN that require a minimum floating-point range when determining
                    quantization parameters.
"""

Motivation and Context

QNN requires a minimum floating-point range of 0.0001.

onnxruntime/python/tools/quantization/quantize.py

Signed-off-by: adrianlizarraga <[email protected]>

yufenglee

…on script (microsoft#18278) ### Description Adds the extra option `MinimumRealRange` to the quantization script: ```python3 """ MinimumRealRange= float|None : Default is None. If set to a floating-point value, the calculation of the quantization parameters (i.e., scale and zero point) will enforce a minimum range between rmin and rmax. If (rmax - rmin) is less than the specified minimum range, rmax will be set to rmin + QuantMinRealRange. This is necessary for EPs like QNN that require a minimum floating-point range when determining quantization parameters. """ ``` ### Motivation and Context QNN requires a minimum floating-point range of 0.0001. --------- Signed-off-by: adrianlizarraga <[email protected]>

adrianlizarraga added 6 commits November 3, 2023 15:52

Add QuantMinRealRange to quantization script

e71f9fc

Add unit test for compute_scale_zp()

08352b9

Add compute_scale_zp unit test for 16-bit qmin/qmax

b9bebda

Add unit test for QuantMinRealRange extra option

e345f80

Add test comment

ebd44ae

Revert warning fix

ff790e0

adrianlizarraga marked this pull request as ready for review November 4, 2023 01:23

adrianlizarraga requested review from yufenglee, HectorSVC and jywu-msft November 4, 2023 01:23

adrianlizarraga added 4 commits November 3, 2023 18:29

Edit function comments

c9c39b5

Merge commits from main

d690304

Run lintrunner

7ba1921

Merge latest commits from main branch

e1f7a1d

adrianlizarraga commented Nov 7, 2023

View reviewed changes

onnxruntime/python/tools/quantization/quantize.py Outdated Show resolved Hide resolved

adrianlizarraga added 3 commits November 8, 2023 09:46

Rename option to MinimumRealRange

3975ace

Signed-off-by: adrianlizarraga <[email protected]>

Rename test file for MinimumRealRange

cf7a075

Signed-off-by: adrianlizarraga <[email protected]>

Merge latest commits from the main branch

71904d1

adrianlizarraga changed the title ~~[QNN EP/Quantization] Add QuantMinRealRange extra option to quantization script~~ [QNN EP/Quantization] Add MinimumRealRange extra option to quantization script Nov 8, 2023

Run lintrunner

2da0323

yufenglee approved these changes Nov 8, 2023

View reviewed changes

Merge latest code from main

c8f58d8

adrianlizarraga merged commit f237b0b into main Nov 9, 2023
91 checks passed

adrianlizarraga deleted the adrianl/quantization-min-real-range branch November 9, 2023 18:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[QNN EP/Quantization] Add MinimumRealRange extra option to quantization script #18278

[QNN EP/Quantization] Add MinimumRealRange extra option to quantization script #18278

adrianlizarraga commented Nov 3, 2023 •

edited

Loading

yufenglee left a comment

[QNN EP/Quantization] Add MinimumRealRange extra option to quantization script #18278

[QNN EP/Quantization] Add MinimumRealRange extra option to quantization script #18278

Conversation

adrianlizarraga commented Nov 3, 2023 • edited Loading

Description

Motivation and Context

yufenglee left a comment

Choose a reason for hiding this comment

adrianlizarraga commented Nov 3, 2023 •

edited

Loading