Is there any way to convert a qdqmodel to qlinearmodel use ort? #18511

Rickustc · 2023-11-20T10:34:18Z

Describe the issue

I have a qdqmodel which weight in operator actually be int 8

is there any setting in ort.seesion that can make qdq into qlinear mode? like this:

v

To reproduce

btw it is a resnet50 :D

Urgency

No response

Platform

Linux

OS Version

latest

ONNX Runtime Installation

Built from Source

ONNX Runtime Version or Commit ID

latest

ONNX Runtime API

Python

Architecture

X64

Execution Provider

Default CPU

Execution Provider Library Version

No response

yufenglee · 2023-11-20T16:35:39Z

You can try setting SessionOption's optimized_model_filepath to the destination path and graph_optimization_level to onnxruntime.GraphOptimizationLevel.ORT_ENABLE_EXTENDED and load the model, then the optimized model will be saved to destination path. However, it is possible that not all QDQ are fused into QLinear.

Rickustc · 2023-11-30T08:37:28Z

@yufenglee thank you for reply！ Is there any way to control single operator weather it is converted to QLinear op?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Is there any way to convert a qdqmodel to qlinearmodel use ort? #18511

Is there any way to convert a qdqmodel to qlinearmodel use ort? #18511

Rickustc commented Nov 20, 2023

yufenglee commented Nov 20, 2023

Rickustc commented Nov 30, 2023

Is there any way to convert a qdqmodel to qlinearmodel use ort? #18511

Is there any way to convert a qdqmodel to qlinearmodel use ort? #18511

Comments

Rickustc commented Nov 20, 2023

Describe the issue

To reproduce

Urgency

Platform

OS Version

ONNX Runtime Installation

ONNX Runtime Version or Commit ID

ONNX Runtime API

Architecture

Execution Provider

Execution Provider Library Version

yufenglee commented Nov 20, 2023

Rickustc commented Nov 30, 2023