-
Notifications
You must be signed in to change notification settings - Fork 3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
additional gains from QDQ #11260
Comments
Comparing with QLinearOps, QDQ format is much more flexible and helps the ONNX quantization ecosystem. Here are some benefits examples:
|
Hello Yufeng, @yufenglee I am still curious to learn more about this topic.
I would appreciate your comments on this matter. In fact, I am planning to conduct comparative experiments to validate the advantages of the QDQ format. Could you please provide some advice and share your comments on this? |
Hello Yufeng, |
@Rickustc Did you find any solution for QDQ to Qoperator? I met the similar problem and I found another quant tool which generates Qoperator model. |
I have the same question. @Rickustc @JiliangNi Did you find the solution? |
Please refer to #21137 for the answer. |
Hi,
I am confused about what additional gains we can get from QDQ format compared with quantization with QLinearOps, can you share me some ideas? For example, if QDQ format is more general, which cases confirm it?
Thanks
The text was updated successfully, but these errors were encountered: