Is there any way to retrieve Quantization type and Quantization parameters using onnxruntime ? #19916
Labels
quantization
issues related to quantization
stale
issues that have not been addressed in a while; categorized by a bot
Describe the issue
Hello,
I'm trying to get quantization parameters from an input tensor such as the quantization type (Static Linear per tensor/ Static linear per channel/ dynamic) and the associated quantization parameter (scales & zero_points).
In tensorflow-lite, we are able to check if the model is quantized statically per-tensor or per-channel by simply doing:
I was wondering if there are any ways to do similar quantization parameters retrieving using onnxruntime.
Thank you for your help.
To reproduce
Not applicable
Urgency
This is really urgent since we are migrating from tensorflow-lite to onnxruntime, and this feature is kind of crucial for our implementation.
Platform
Linux
OS Version
Ubuntu 22.04
ONNX Runtime Installation
Built from Source
ONNX Runtime Version or Commit ID
1.15.1
ONNX Runtime API
C++
Architecture
X64
Execution Provider
Default CPU
Execution Provider Library Version
No response
The text was updated successfully, but these errors were encountered: