[WIP] Stable Diffusion 3.x and Flux Optimization #22986

tianleiwu · 2024-12-02T23:35:19Z

Description

This work is in progress.

Optimize the ONNX pipeline for Stable Diffusion 3.x and Flux 1.0 models (fp32 or fp16).

Optimize the ONNX pipeline for Stable Diffusion 3.x and Flux 1.0 models

python optimize_pipeline.py -i ./flux1_schnell_onnx/fp32 -o ./flux1_schnell_onnx/fp16 --float16

_optimize_sd_pipeline: Optimize flux1_schnell_onnx/fp32/transformer/model.onnx ...
               apply: Fused LayerNormalization: 115
               apply: Fused SimplifiedLayerNormalization: 152
               apply: Fused FastGelu: 76
               apply: Fused MultiHeadAttention: 57
         prune_graph: Removed 1406 nodes

Motivation and Context

onnxruntime/python/tools/transformers/onnx_model_mmdit.py

onnxruntime/python/tools/transformers/fusion_fastgelu.py

@@ -358,3 +361,122 @@
        self.nodes_to_add.append(fused_node)
        self.node_name_to_graph_name[fused_node.name] = self.this_graph_name
        return True
+
+    def fuse_4(self, tanh_node, input_name_to_nodes: Dict, output_name_to_node: Dict) -> Optional[bool]:


onnxruntime/python/tools/transformers/onnx_model_mmdit.py

onnxruntime/python/tools/transformers/models/stable_diffusion/optimize_pipeline.py

github-actions

You can commit the suggested changes from lintrunner.

onnxruntime/python/tools/transformers/fusion_mha_mmdit.py

onnxruntime/python/tools/transformers/models/stable_diffusion/benchmark.py

onnxruntime/python/tools/transformers/fusion_mha_mmdit.py

onnxruntime/python/tools/transformers/onnx_model_mmdit.py

+        # if (options is None) or options.enable_skip_layer_norm:
+        #    self.fuse_skip_simplified_layer_norm()
+        #    self.fuse_skip_layer_norm()
+        # if (options is None) or options.enable_bias_skip_layer_norm:
+        #     # Fuse SkipLayerNormalization and Add Bias before it.
+        #     self.fuse_add_bias_skip_layer_norm()


tianleiwu added 2 commits November 22, 2024 13:59

initial

6fb7369

sd3.x and flux

9b2dcc0

github-advanced-security bot found potential problems Dec 2, 2024

View reviewed changes

onnxruntime/python/tools/transformers/onnx_model_mmdit.py Fixed Show fixed Hide fixed

tianleiwu marked this pull request as draft December 3, 2024 19:19

update FastGelu and RMSNorm fusions

7f925ce

github-advanced-security bot found potential problems Dec 5, 2024

View reviewed changes

tianleiwu added 7 commits December 6, 2024 00:30

support Reciprocal in RMSNorm fusion

cf259e1

match_child_path interface change

b38f12e

clean up

a58b68c

MHA fusion for MMDit

c7317cb

cuda layernorm support broadcast

2f5b9b9

force fuse layernorm

699a64c

refactoring

c1d0160

github-advanced-security bot found potential problems Dec 15, 2024

View reviewed changes

onnxruntime/python/tools/transformers/onnx_model_mmdit.py Fixed Show fixed Hide fixed

ACinfr reviewed Dec 16, 2024

View reviewed changes

onnxruntime/python/tools/transformers/models/stable_diffusion/optimize_pipeline.py Outdated Show resolved Hide resolved

mha fusion for flux

1b9ea54

github-actions bot reviewed Dec 19, 2024

View reviewed changes

github-advanced-security bot found potential problems Dec 19, 2024

View reviewed changes

onnxruntime/python/tools/transformers/fusion_mha_mmdit.py Fixed Show fixed Hide fixed

remove transpose for query

5528276

github-advanced-security bot found potential problems Dec 20, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] Stable Diffusion 3.x and Flux Optimization #22986

[WIP] Stable Diffusion 3.x and Flux Optimization #22986

tianleiwu commented Dec 2, 2024 •

edited

Loading

github-actions bot left a comment

[WIP] Stable Diffusion 3.x and Flux Optimization #22986

Are you sure you want to change the base?

[WIP] Stable Diffusion 3.x and Flux Optimization #22986

Conversation

tianleiwu commented Dec 2, 2024 • edited Loading

Description

Motivation and Context

github-actions bot left a comment

Choose a reason for hiding this comment

tianleiwu commented Dec 2, 2024 •

edited

Loading