Add ModelProto support for quantize api #20018

xiaoyu-work · 2024-03-21T23:30:21Z

Description

Add ModelProto support for quantize api

Motivation and Context

Currently, the quantize API only accepts a model path as the input model. However, for large models, saving and loading from disk can be time-consuming. By adding ModelProto as an input option to the quantize API, significant time can be saved.

onnxruntime/python/tools/quantization/quantize.py

onnxruntime/python/tools/quantization/execution_providers/qnn/preprocess.py

onnxruntime/python/tools/quantization/quant_utils.py

tianleiwu · 2024-03-26T03:34:10Z

/azp run Windows ARM64 QNN CI Pipeline,Windows x64 QNN CI Pipeline,Windows CPU CI Pipeline,Windows GPU CI Pipeline,Windows GPU TensorRT CI Pipeline,ONNX Runtime Web CI Pipeline,Linux CPU CI Pipeline,Linux CPU Minimal Build E2E CI Pipeline,Linux GPU CI Pipeline,Linux GPU TensorRT CI Pipeline

tianleiwu · 2024-03-26T03:34:11Z

/azp run Linux OpenVINO CI Pipeline,Linux QNN CI Pipeline,MacOS CI Pipeline,orttraining-amd-gpu-ci-pipeline,orttraining-linux-ci-pipeline,orttraining-linux-gpu-ci-pipeline,orttraining-ortmodule-distributed,onnxruntime-binary-size-checks-ci-pipeline,Big Models,Android CI Pipeline

tianleiwu · 2024-03-26T03:34:12Z

/azp run iOS CI Pipeline,ONNX Runtime React Native CI Pipeline

azure-pipelines · 2024-03-26T03:34:26Z

Azure Pipelines successfully started running 2 pipeline(s).

azure-pipelines · 2024-03-26T03:34:44Z

Azure Pipelines successfully started running 10 pipeline(s).

azure-pipelines · 2024-03-26T03:34:45Z

Azure Pipelines successfully started running 10 pipeline(s).

onnxruntime/python/tools/quantization/shape_inference.py

tianleiwu · 2024-03-26T19:15:43Z

/azp run Windows ARM64 QNN CI Pipeline,Windows x64 QNN CI Pipeline,Windows CPU CI Pipeline,Windows GPU CI Pipeline,Windows GPU TensorRT CI Pipeline,ONNX Runtime Web CI Pipeline,Linux CPU CI Pipeline,Linux CPU Minimal Build E2E CI Pipeline,Linux GPU CI Pipeline,Linux GPU TensorRT CI Pipeline

tianleiwu · 2024-03-26T19:15:44Z

/azp run Linux OpenVINO CI Pipeline,Linux QNN CI Pipeline,MacOS CI Pipeline,orttraining-amd-gpu-ci-pipeline,orttraining-linux-ci-pipeline,orttraining-linux-gpu-ci-pipeline,orttraining-ortmodule-distributed,onnxruntime-binary-size-checks-ci-pipeline,Big Models,Android CI Pipeline

tianleiwu · 2024-03-26T19:15:45Z

/azp run iOS CI Pipeline,ONNX Runtime React Native CI Pipeline

azure-pipelines · 2024-03-26T19:16:02Z

Azure Pipelines successfully started running 2 pipeline(s).

azure-pipelines · 2024-03-26T19:16:25Z

Azure Pipelines successfully started running 10 pipeline(s).

azure-pipelines · 2024-03-26T19:16:25Z

Azure Pipelines successfully started running 10 pipeline(s).

onnxruntime/python/tools/quantization/shape_inference.py

### Description Add ModelProto support for `quantize` api ### Motivation and Context Currently, the `quantize` API only accepts a model path as the input model. However, for large models, saving and loading from disk can be time-consuming. By adding `ModelProto` as an input option to the `quantize` API, significant time can be saved.

yufenglee reviewed Mar 21, 2024

View reviewed changes

onnxruntime/python/tools/quantization/quantize.py Outdated Show resolved Hide resolved

Add ModelProto support for quantize api

41328bd

xiaoyu-work force-pushed the quant branch from 5c234a5 to 41328bd Compare March 22, 2024 18:30

xiaoyu-work added 3 commits March 22, 2024 18:34

fix nit

c5ecda3

save

e7ccd10

Merge remote-tracking branch 'origin/main' into quant

9d737fc

tianleiwu reviewed Mar 23, 2024

View reviewed changes

onnxruntime/python/tools/quantization/quantize.py Outdated Show resolved Hide resolved

tianleiwu reviewed Mar 23, 2024

View reviewed changes

onnxruntime/python/tools/quantization/quantize.py Outdated Show resolved Hide resolved

tianleiwu reviewed Mar 23, 2024

View reviewed changes

onnxruntime/python/tools/quantization/quantize.py Show resolved Hide resolved

tianleiwu reviewed Mar 23, 2024

View reviewed changes

onnxruntime/python/tools/quantization/quantize.py Outdated Show resolved Hide resolved

update load model with shape infer

db692fc

github-advanced-security bot found potential problems Mar 25, 2024

View reviewed changes

onnxruntime/python/tools/quantization/execution_providers/qnn/preprocess.py Fixed Show resolved Hide resolved

onnxruntime/python/tools/quantization/execution_providers/qnn/preprocess.py Fixed Show fixed Hide fixed

github-advanced-security bot found potential problems Mar 25, 2024

View reviewed changes

onnxruntime/python/tools/quantization/quant_utils.py Fixed Show fixed Hide fixed

xiaoyu-work added 2 commits March 25, 2024 19:32

fix cyclic import

d95e8cf

Merge remote-tracking branch 'origin/main' into quant

23fd040

tianleiwu reviewed Mar 26, 2024

View reviewed changes

onnxruntime/python/tools/quantization/shape_inference.py Outdated Show resolved Hide resolved

Fix import

d7969f4

tianleiwu reviewed Mar 27, 2024

View reviewed changes

onnxruntime/python/tools/quantization/shape_inference.py Show resolved Hide resolved

tianleiwu approved these changes Mar 27, 2024

View reviewed changes

justinchuby merged commit c8676ff into microsoft:main Mar 27, 2024
80 of 82 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add ModelProto support for quantize api #20018

Add ModelProto support for quantize api #20018

xiaoyu-work commented Mar 21, 2024

tianleiwu commented Mar 26, 2024

tianleiwu commented Mar 26, 2024

tianleiwu commented Mar 26, 2024

azure-pipelines bot commented Mar 26, 2024

azure-pipelines bot commented Mar 26, 2024

azure-pipelines bot commented Mar 26, 2024

tianleiwu commented Mar 26, 2024

tianleiwu commented Mar 26, 2024

tianleiwu commented Mar 26, 2024

azure-pipelines bot commented Mar 26, 2024

azure-pipelines bot commented Mar 26, 2024

azure-pipelines bot commented Mar 26, 2024

Add ModelProto support for quantize api #20018

Add ModelProto support for quantize api #20018

Conversation

xiaoyu-work commented Mar 21, 2024

Description

Motivation and Context

tianleiwu commented Mar 26, 2024

tianleiwu commented Mar 26, 2024

tianleiwu commented Mar 26, 2024

azure-pipelines bot commented Mar 26, 2024

azure-pipelines bot commented Mar 26, 2024

azure-pipelines bot commented Mar 26, 2024

tianleiwu commented Mar 26, 2024

tianleiwu commented Mar 26, 2024

tianleiwu commented Mar 26, 2024

azure-pipelines bot commented Mar 26, 2024

azure-pipelines bot commented Mar 26, 2024

azure-pipelines bot commented Mar 26, 2024