Add fusion patterns for conformer-transducer model #18461

apsonawane · 2023-11-16T01:30:40Z

Description

Add conformer-transducer model type to optimizer. This PR adds pattern matches for attention shown below:
Unfused attention:

Fused attention:

onnxruntime/python/tools/transformers/fusion_transducer_attention.py

onnxruntime/python/tools/transformers/onnx_model_ct.py

onnxruntime/python/tools/transformers/fusion_attention.py

tianleiwu · 2023-11-16T15:08:00Z

Pease add a test case for the attention fusion. Otherwise, it is not able to prevent regression in the future.

onnxruntime/python/tools/transformers/optimizer.py

onnxruntime/test/python/transformers/ct_model_generator.py

onnxruntime/python/tools/transformers/fusion_conformer_attention.py

onnxruntime/python/tools/transformers/onnx_model_ct.py

onnxruntime/python/tools/transformers/fusion_attention.py

onnxruntime/python/tools/transformers/fusion_conformer_attention.py

onnxruntime/python/tools/transformers/optimizer.py

onnxruntime/test/python/transformers/conformer_model_generator.py

+from typing import List
+
+import numpy as np
+import onnx


onnxruntime/python/tools/transformers/onnx_model_conformer.py

+class ConformerOnnxModel(BertOnnxModel):
+    def __init__(self, model, num_heads, hidden_size):
+        super().__init__(model, num_heads, hidden_size)
+        self.attention_mask = AttentionMask(self)


onnxruntime/python/tools/transformers/onnx_model_conformer.py

+    def __init__(self, model, num_heads, hidden_size):
+        super().__init__(model, num_heads, hidden_size)
+        self.attention_mask = AttentionMask(self)
+        self.attention_fusion = FusionConformerAttention(self, self.hidden_size, self.num_heads, self.attention_mask)


onnxruntime/python/tools/transformers/fusion_conformer_attention.py

onnxruntime/test/python/transformers/conformer_model_generator.py

onnxruntime/python/tools/transformers/fusion_conformer_attention.py

cmake/onnxruntime_python.cmake

onnxruntime/python/tools/transformers/fusion_attention.py

### Description Add conformer-transducer model type to optimizer. This PR adds pattern matches for attention shown below: Unfused attention: ![ct_unfused](https://github.com/microsoft/onnxruntime/assets/111780983/46c71ed8-67e0-4607-85b1-bcadba5a2956) Fused attention: ![ct_fused](https://github.com/microsoft/onnxruntime/assets/111780983/fbb91c96-0d4b-4f0b-8674-1ae3b9b9a92e)

apsonawane requested a review from tianleiwu November 16, 2023 01:30

apsonawane force-pushed the asonawane/conformer branch from c58247f to 98dea4d Compare November 16, 2023 01:33

github-advanced-security bot found potential problems Nov 16, 2023

View reviewed changes

apsonawane force-pushed the asonawane/conformer branch from 98dea4d to 84600a0 Compare November 16, 2023 08:42

tianleiwu reviewed Nov 16, 2023

View reviewed changes

onnxruntime/python/tools/transformers/fusion_attention.py Outdated Show resolved Hide resolved

tianleiwu reviewed Nov 16, 2023

View reviewed changes

onnxruntime/python/tools/transformers/fusion_attention.py Outdated Show resolved Hide resolved

tianleiwu reviewed Nov 16, 2023

View reviewed changes

onnxruntime/python/tools/transformers/fusion_attention.py Outdated Show resolved Hide resolved

tianleiwu reviewed Nov 16, 2023

View reviewed changes

onnxruntime/python/tools/transformers/optimizer.py Outdated Show resolved Hide resolved

apsonawane added 2 commits November 17, 2023 02:29

Add fusion patterns for conformer-transducer model

9a61046

Add attention fusion test and update name from transducer to conformer

83f75a7

github-advanced-security bot found potential problems Nov 17, 2023

View reviewed changes

onnxruntime/test/python/transformers/ct_model_generator.py Fixed Show fixed Hide fixed

onnxruntime/test/python/transformers/ct_model_generator.py Fixed Show fixed Hide fixed

onnxruntime/test/python/transformers/ct_model_generator.py Fixed Show fixed Hide fixed

github-advanced-security bot found potential problems Nov 17, 2023

View reviewed changes

apsonawane force-pushed the asonawane/conformer branch 2 times, most recently from 8fc8a31 to fd8d044 Compare November 17, 2023 10:33

Rebase with main and add onnx test files

7a730c2

apsonawane force-pushed the asonawane/conformer branch from fd8d044 to 7a730c2 Compare November 17, 2023 10:41

apsonawane requested a review from tianleiwu November 17, 2023 12:44

Fix CI pipelines failures

dda939e

apsonawane force-pushed the asonawane/conformer branch from 3fe5d00 to dda939e Compare November 17, 2023 18:18

tianleiwu reviewed Nov 17, 2023

View reviewed changes

onnxruntime/python/tools/transformers/fusion_attention.py Show resolved Hide resolved

tianleiwu reviewed Nov 17, 2023

View reviewed changes

onnxruntime/python/tools/transformers/fusion_conformer_attention.py Outdated Show resolved Hide resolved

tianleiwu reviewed Nov 17, 2023

View reviewed changes

onnxruntime/python/tools/transformers/fusion_conformer_attention.py Outdated Show resolved Hide resolved

tianleiwu reviewed Nov 17, 2023

View reviewed changes

onnxruntime/python/tools/transformers/optimizer.py Outdated Show resolved Hide resolved

tianleiwu reviewed Nov 17, 2023

View reviewed changes

onnxruntime/python/tools/transformers/optimizer.py Outdated Show resolved Hide resolved

Address comments

3fe90d1

github-advanced-security bot found potential problems Nov 17, 2023

View reviewed changes