RMS Normalization and Skip RMS Normalization fusion optimizations #1974

gramalingam · 2024-12-09T19:17:39Z

Implements RMS Normalization and Skip RMS Normalization fusion optimizations (for use of onnxruntime custom fused ops for these).

codecov · 2024-12-09T19:20:08Z

❌ 17 Tests Failed:

Tests completed	Failed	Passed	Skipped
16147	17	16130	3318

View the top 2 failed tests by shortest run time

::onnxscript.rewriter.onnxruntime.xformers._test_models

Stack Traces | 0s run time

No failure message available

tests.function_libs.torch_lib.ops_test.TestOutputConsistencyEagerCPU::test_output_match_opinfo__clamp_cpu_float16

Stack Traces | 0.001s run time

No failure message available

View the full list of 1 ❄️ flaky tests

tests.eager_mode_test.TestEagerModeArguments_0_reference_runtime::test_function_input_and_attribute_by_kwargs_out_of_order

Flake rate in main: 39.38% (Passed 12384 times, Failed 8045 times)

Stack Traces | 0.003s run time

.nox\test_torch_nightly\Lib\site-packages\onnx\reference\ops\_op.py:91: in run
    res = self._run(x, y)
.nox\test_torch_nightly\Lib\site-packages\onnx\reference\ops\_op.py:139: in _run
    res = (convert_from_ml_dtypes(res[0]),)
.nox\test_torch_nightly\Lib\site-packages\onnx\reference\custom_element_types.py:50: in convert_from_ml_dtypes
    return array.view(dtype=dtype)
E   ValueError: Changing the dtype of a 0d array is only supported if the itemsize is unchanged

The above exception was the direct cause of the following exception:
tests\eager_mode_test.py:115: in test_function_input_and_attribute_by_kwargs_out_of_order
    self.assertEqual(add_with_alpha(alpha=3.0, other=2.0, this=1.0), 7.0)
onnxscript\values.py:576: in __call__
    return evaluator.default().eval_function(self, args, kwargs)
onnxscript\evaluator.py:307: in eval_function
    result = function.function(*adapted_args, **adapted_kwargs)
tests\eager_mode_test.py:59: in add_with_alpha
    other = op.Mul(other, alpha)
onnxscript\onnx_opset\_impl\opset14.py:696: in Mul
    return op(*self._prepare_inputs(schema, A, B))
onnxscript\values.py:304: in __call__
    return evaluator.default().eval(schema, args, kwargs)
onnxscript\evaluator.py:194: in eval
    outputs = self._eval(schema, inputs, attributes, closure)
onnxscript\evaluator.py:524: in _eval
    result = session.run(None, session_run_input)
.nox\test_torch_nightly\Lib\site-packages\onnx\reference\reference_evaluator.py:599: in run
    outputs = node.run(*inputs, **linked_attributes)
.nox\test_torch_nightly\Lib\site-packages\onnx\reference\ops\_op.py:114: in run
    res = OpRunBinary.run(self, x, y)
.nox\test_torch_nightly\Lib\site-packages\onnx\reference\ops\_op.py:93: in run
    raise TypeError(
E   TypeError: Issues with types <class 'numpy.ndarray'>, <class 'numpy.ndarray'> (binary operator 'Mul').

To view more test analytics, go to the Test Analytics Dashboard
📢 Thoughts on this report? Let us know!

onnxscript/rewriter/onnxruntime/xformers/_test_models.py

onnxscript/rewriter/onnxruntime/xformers/skip_normalization.py

xadupre · 2024-12-13T09:34:24Z

It looks good. At some point, we have a page in the documentation which lists all the possible optimization and how to trigger them. Even for us, this page should prove useful.

github-advanced-security

lintrunner found more than 20 potential problems in the proposed changes. Check the Files changed tab for more details.

justinchuby · 2024-12-27T18:58:44Z

It looks good. At some point, we have a page in the documentation which lists all the possible optimization and how to trigger them. Even for us, this page should prove useful.

Sounds good. I think we should autogen this documentation from source.

gramalingam added 5 commits December 7, 2024 21:39

RMS Normalization

0b04568

Run lint

c150077

Add skip normalization fusion

9de7863

Refactor rule variations

90b4881

Move baseclass to pattern

211ca97

github-advanced-security bot found potential problems Dec 9, 2024

View reviewed changes

gramalingam added 2 commits December 9, 2024 12:04

Merge branch 'main' into rama/norm-fusion

392a1de

Fix lint warnings

b64f222

xadupre approved these changes Dec 13, 2024

View reviewed changes

gramalingam enabled auto-merge (squash) December 13, 2024 19:23

gramalingam added 2 commits December 13, 2024 11:27

Address mypy warning

07c270a

Add onnxscript-based test-case

c6a73fe

github-advanced-security bot found potential problems Dec 14, 2024

View reviewed changes

gramalingam added 3 commits December 13, 2024 20:52

Fix skip norm test

6ee3cec

Run lint

4041dad

Exclude onnxscript code from mypy

6069399

gramalingam merged commit 0aed232 into main Dec 14, 2024
20 of 39 checks passed

gramalingam deleted the rama/norm-fusion branch December 14, 2024 05:24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RMS Normalization and Skip RMS Normalization fusion optimizations #1974

RMS Normalization and Skip RMS Normalization fusion optimizations #1974

gramalingam commented Dec 9, 2024

codecov bot commented Dec 9, 2024 •

edited

Loading

xadupre commented Dec 13, 2024

github-advanced-security bot left a comment

justinchuby commented Dec 27, 2024

RMS Normalization and Skip RMS Normalization fusion optimizations #1974

RMS Normalization and Skip RMS Normalization fusion optimizations #1974

Conversation

gramalingam commented Dec 9, 2024

codecov bot commented Dec 9, 2024 • edited Loading

❌ 17 Tests Failed:

xadupre commented Dec 13, 2024

github-advanced-security bot left a comment

Choose a reason for hiding this comment

justinchuby commented Dec 27, 2024

codecov bot commented Dec 9, 2024 •

edited

Loading