If Branch Constant Folding #18105

yuslepukhin · 2023-10-26T00:16:33Z

Description

When and if If condition proves to be a constant value, inline the corresponding subgraph yielding to more constant folding and optimization.

Motivation and Context

Newly converted models feature lots of nested If nodes that can be inlined and collapsed.

In particular, for the sample models we are gaining on TorchScript exported models.
For HF Mobile Bert Dynamo runtime went down from 0.069 -> 0.046. In total, AOT inlining + If constant folding
yields improvement of about 50% 0.102 -> 0.046. Brining us very close to TorchScript exported models.

HF Bart Dynamo further improves 0.668 -> 0.45. AOT + If constant folding improves 0.98 -> 0.45

Earlier the size of
HF Mobile Bert 161Mb+, now 98Mb
HF Bart Dynamo pre-optimized model was about 1.2Gb. It is now 710MB

Complete Finalize

Currently, there is a problem QDQ pairing transformer that ensures pairing between QDQ pairs including within the graph.

…lding

Change the type of the templates map so memory is partially released when not all of the functions are removed. Change the inline naming schema with slashes.

…lding

Add tests Rename the function. Remove functions directly from the partitioner.

…lding

snnn · 2023-11-08T23:28:36Z

/azp run Windows GPU CI Pipeline, Windows GPU TensorRT CI Pipeline, orttraining-ortmodule-distributed

azure-pipelines · 2023-11-08T23:28:54Z

Azure Pipelines successfully started running 3 pipeline(s).

when GraphViewer is not able to topologically sort new nodes due to lack of edges. We generate edges and make sure that nodes with subgraphs get implicit inputs setup.

onnxruntime/core/graph/graph.cc

onnxruntime/core/optimizer/constant_folding.cc

onnxruntime/core/graph/graph.cc

onnxruntime/test/testdata/transform/transform_nested_ifs_toplogical_sorted_nodes.txt

onnxruntime/test/testdata/transform/transform_nested_ifs_toplogical_sorted_nodes.py

onnxruntime/core/graph/graph.cc

pranavsharma

Some minor comments.

onnxruntime/core/optimizer/constant_folding.cc

include/onnxruntime/core/graph/graph.h

onnxruntime/core/graph/graph.cc

…args (#18462) ### Description Truncate traling non-existing arguments. Make sure we do not skip on the non-existing arguments in the middle, because shape inferece relies on their proper position. This also affects the argument position in the Edges that must be properly rebuilt each time If node branch is inlined. Make sure that when we rename Defs in subgraphs, new renamed defs are created in those subgraphs instead of pointing to outer scope defs. Add unit test. ### Motivation and Context This is a follow up for #18105 Currently, the non-trailing arguments are simply ignored and the edges are created with potentially incorrect positions.

### Description When and if `If` condition proves to be a constant value, inline the corresponding subgraph yielding to more constant folding and optimization. ### Motivation and Context Newly converted models feature lots of nested `If` nodes that can be inlined and collapsed. In particular, for the sample models we are gaining on TorchScript exported models. For `HF Mobile Bert Dynamo` runtime went down from 0.069 -> 0.046. In total, AOT inlining + `If` constant folding yields improvement of about 50% 0.102 -> 0.046. Brining us very close to TorchScript exported models. `HF Bart Dynamo` further improves 0.668 -> 0.45. AOT + `If` constant folding improves 0.98 -> 0.45 Earlier the size of HF Mobile Bert **161Mb+**, now **98Mb** HF Bart Dynamo pre-optimized model was about **1.2Gb**. It is now **710MB** ![image](https://github.com/microsoft/onnxruntime/assets/11303988/1491a247-d371-4e66-85a3-2aeb702e8ca0)

…args (microsoft#18462) ### Description Truncate traling non-existing arguments. Make sure we do not skip on the non-existing arguments in the middle, because shape inferece relies on their proper position. This also affects the argument position in the Edges that must be properly rebuilt each time If node branch is inlined. Make sure that when we rename Defs in subgraphs, new renamed defs are created in those subgraphs instead of pointing to outer scope defs. Add unit test. ### Motivation and Context This is a follow up for microsoft#18105 Currently, the non-trailing arguments are simply ignored and the edges are created with potentially incorrect positions.

yuslepukhin added 30 commits October 9, 2023 15:06

Add InlineSubgraph

0de61fa

Complete Finalize

AOT Inlining

9bfaabf

Add Span() based defs back

5637ad4

Lint

9da7a1e

Implement If node ConstantFolding.

c3419ad

Currently, there is a problem QDQ pairing transformer that ensures pairing between QDQ pairs including within the graph.

Merge branch 'main' into yuslepukhin/aot_inline

3da463d

Address some review comments

bf3a3b2

Revert InlineFunction

18df323

Remove local functions

5a19dcf

Common graph_viewer

7b3efdc

Merge branch 'main' into yuslepukhin/aot_inline

2d56e5a

Merge branch 'yuslepukhin/aot_inline' into yuslepukhin/if_constant_fo…

fd8be36

…lding

Remove spurious test, lint.

c5ee067

Merge branch 'yuslepukhin/aot_inline' into yuslepukhin/if_constant_fo…

7c1f9a6

…lding

Prevent If Constant Fold in the QDQ test

4845104

If Constant Folding non-recursive

8dc17a3

HF Bart works

c81c1bf

Bug fixes

cce536f

Merge branch 'main' into yuslepukhin/aot_inline

39439bb

Do not remove non-inlined function definitins.

d91740c

Merge branch 'yuslepukhin/aot_inline' into yuslepukhin/if_constant_fo…

bcb402f

…lding

Add topo order. HF Bert works.

04c6fd6

Compute function id from domain, OpType of the node.

c3c20b4

Change the type of the templates map so memory is partially released when not all of the functions are removed. Change the inline naming schema with slashes.

Compute function id from domain, OpType of the node.

1108cc1

Change the type of the templates map so memory is partially released when not all of the functions are removed. Change the inline naming schema with slashes.

Revert back to underscore

c945f96

Merge branch 'yuslepukhin/aot_inline' into yuslepukhin/if_constant_fo…

6a8897d

…lding

Add kill switch for AOT

2db2e59

Add tests Rename the function. Remove functions directly from the partitioner.

Merge branch 'yuslepukhin/aot_inline' into yuslepukhin/if_constant_fo…

c86919b

…lding

build error

d9106d1

Merge branch 'yuslepukhin/aot_inline' into yuslepukhin/if_constant_fo…

4468fc1

…lding

Remove profiling test

a0b8899

gramalingam previously approved these changes Nov 8, 2023

View reviewed changes

edgchen1 previously approved these changes Nov 8, 2023

View reviewed changes

Merge branch 'main' into yuslepukhin/if_constant_folding

f0e84ac

yuslepukhin added 2 commits November 9, 2023 17:33

Immediate consequetive nested Ifs inlining causes an issue

a17b3e0

when GraphViewer is not able to topologically sort new nodes due to lack of edges. We generate edges and make sure that nodes with subgraphs get implicit inputs setup.

Merge branch 'main' into yuslepukhin/if_constant_folding

60d99fb

yuslepukhin dismissed stale reviews from edgchen1 and gramalingam via 60d99fb November 10, 2023 01:36

Adjust map type

f0480cb

edgchen1 reviewed Nov 13, 2023

View reviewed changes

yuslepukhin added 2 commits November 13, 2023 13:22

Merge branch 'main' into yuslepukhin/if_constant_folding

0ea3e23

Address review comments

d955fbf

github-advanced-security bot found potential problems Nov 13, 2023

View reviewed changes

Lint

94274a9

github-advanced-security bot found potential problems Nov 13, 2023

View reviewed changes

lint 2

f38afc3

gramalingam reviewed Nov 13, 2023

View reviewed changes

onnxruntime/core/graph/graph.cc Outdated Show resolved Hide resolved

map_defs usage

7805d47

gramalingam approved these changes Nov 14, 2023

View reviewed changes

pranavsharma reviewed Nov 14, 2023

View reviewed changes

pranavsharma approved these changes Nov 14, 2023

View reviewed changes

yuslepukhin merged commit f19c673 into main Nov 14, 2023
84 of 89 checks passed

yuslepukhin deleted the yuslepukhin/if_constant_folding branch November 14, 2023 01:33

yuslepukhin mentioned this pull request Nov 16, 2023

Create edges with arg positons correctly accounting for non-existing args #18462

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

If Branch Constant Folding #18105

If Branch Constant Folding #18105

yuslepukhin commented Oct 26, 2023 •

edited

Loading

snnn commented Nov 8, 2023

azure-pipelines bot commented Nov 8, 2023

pranavsharma left a comment

If Branch Constant Folding #18105

If Branch Constant Folding #18105

Conversation

yuslepukhin commented Oct 26, 2023 • edited Loading

Description

Motivation and Context

snnn commented Nov 8, 2023

azure-pipelines bot commented Nov 8, 2023

pranavsharma left a comment

Choose a reason for hiding this comment

yuslepukhin commented Oct 26, 2023 •

edited

Loading