Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[DRAFT] Add smooth quant #13659

Closed
wants to merge 3 commits into from
Closed

[DRAFT] Add smooth quant #13659

wants to merge 3 commits into from

Conversation

mhs4670go
Copy link
Contributor

This commit adds smooth quant algorithm.

ONE-DCO-1.0-Signed-off-by: seongwoo [email protected]

@mhs4670go mhs4670go added the DRAFT A draft issue or PR for sharing one's current working status and discussion. label Aug 13, 2024
This commit adds smooth quant algorithm.

ONE-DCO-1.0-Signed-off-by: seongwoo <[email protected]>
@mhs4670go
Copy link
Contributor Author

@jinevening
Overall, let me describe changes briefly.

  1. Pattern file saves act_scale instead of scale.

For smooth quant, it saves activation max first and, for now, "scale" for smooth quant is calculated in runtime.

  1. fm-equalize can receive each drivers path manually. Of course default path is same with the current one: same directory of fm-equalize.

This is for flexibility and testing (fm-equalize-value-py-test).

  1. Type of patterns is only "ScaleOnly".

  2. Passes are only "FusePreScalePass" and "FusePostScalePass".

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
DRAFT A draft issue or PR for sharing one's current working status and discussion.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant