You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
You can try setting SessionOption's optimized_model_filepath to the destination path and graph_optimization_level to onnxruntime.GraphOptimizationLevel.ORT_ENABLE_EXTENDED and load the model, then the optimized model will be saved to destination path. However, it is possible that not all QDQ are fused into QLinear.
Describe the issue
I have a qdqmodel which weight in operator actually be int 8
is there any setting in ort.seesion that can make qdq into qlinear mode? like this:
v
To reproduce
btw it is a resnet50 :D
Urgency
No response
Platform
Linux
OS Version
latest
ONNX Runtime Installation
Built from Source
ONNX Runtime Version or Commit ID
latest
ONNX Runtime API
Python
Architecture
X64
Execution Provider
Default CPU
Execution Provider Library Version
No response
The text was updated successfully, but these errors were encountered: