Reuse existing output node when replacing operator with a fusion #401

robertknight · 2024-11-09T06:32:28Z

When replacing a subgraph with a fused operator, reuse the output value node
from the subgraph instead of creating a new one. This preserves metadata such as
the name and shape associated with that value node. Also it simplifies the code
by removing the need to replace all references to the previous output node with
the new one.

This improves the runtime of the Whisper example using the whisper-base model by
~25% by fixing an issue where fused Transpose + MatMul operations did not get
used.

When replacing a subgraph with a fused operator, reuse the output value node from the subgraph instead of creating a new one. This preserves metadata such as the name and shape associated with that value node. Also it simplifies the code by removing the need to replace all references to the previous output node with the new one. This improves the runtime of the Whisper example using the whisper-base model by ~25% by fixing an issue where fused Transpose + MatMul operations did not get used.

Exposing `add_node` as a public method creates a hazard as it allows callers to add nodes to the graph without post-processing steps that other methods do (eg. `add_op`). Make this method private again and add a more specific `add_constant_node` alternative to handle the single use case for it outside the graph module.

robertknight force-pushed the optimize-reuse-output branch from b004dd9 to 04559ca Compare November 9, 2024 06:35

robertknight added 3 commits November 9, 2024 06:36

Correct outdated comment for GraphOptimizer::optimize

6a3abb4

robertknight force-pushed the optimize-reuse-output branch from 04559ca to 4937d83 Compare November 9, 2024 06:36

robertknight merged commit cf1ee74 into main Nov 9, 2024
2 checks passed

robertknight deleted the optimize-reuse-output branch November 9, 2024 06:41

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reuse existing output node when replacing operator with a fusion #401

Reuse existing output node when replacing operator with a fusion #401

robertknight commented Nov 9, 2024

Reuse existing output node when replacing operator with a fusion #401

Reuse existing output node when replacing operator with a fusion #401

Conversation

robertknight commented Nov 9, 2024