DML EP EinSum make more generic to avoid EP fallback #21114

fdwr · 2024-06-20T07:20:47Z

Problem

Newer models using more novel equations (e.g. bhwc,hkc->bhwk in Segment Anything's encoder or bqc,bchw->bqhw) cause fallback from DML to CPU, yielding performance issues. The EP had some pattern matching to map more common equations to existing DML operators, but the number of permutations was prohibitive and could not catch them all.

Solution

So, ditch the static mapping, and instead handle any 1-input or 2-input cases via remapped strides and a mini-graph of elementwise multiplication & sum reduction (as if DML had a DML_OPERATOR_DOT_PRODUCT that took axes). A subset of mappings still exist for performance (GEMM, pure reduction, transpose...), but they are identified generally rather than via a pattern table. Also...

✅ Diagonals are supported now (e.g. iji->i).
✅ Removes any remaining DML-specific EinSum GTEST_SKIP statements.
✅ Handles any cases up to 8 unique labels (DML dimension limit is 8D).
⚠️ >= 3 inputs and arbitrary size inputs via ellipsis are not handled, but we have yet to come across a model.

ℹ️ Note that even with this change such that all nodes are assigned to DML (no CPU fallback), we still end up with multiple partitions because the ONNX EinSum shape inference logic by @peishenyan isn't in ORT yet ⏳.

onnxruntime/core/providers/dml/DmlExecutionProvider/src/TensorDesc.cpp

onnxruntime/core/providers/dml/OperatorAuthorHelper/OperatorHelper.cpp

onnxruntime/test/providers/cpu/math/einsum_test.cc

onnxruntime/core/providers/dml/DmlExecutionProvider/src/DmlCommon.cpp

onnxruntime/core/providers/dml/OperatorAuthorHelper/OperatorHelper.cpp

smk2007 · 2024-06-20T22:16:54Z

void TensorDesc::EnsureStridesExist()

nit: noexcept

Refers to: onnxruntime/core/providers/dml/DmlExecutionProvider/src/TensorDesc.cpp:367 in 17b9461. [](commit_id = 17b9461, deletion_comment = False)

fdwr · 2024-06-21T04:44:55Z

/azp run Big Models,orttraining-amd-gpu-ci-pipeline (Linux_Build_manylinux)

azure-pipelines · 2024-06-21T04:45:07Z

Azure Pipelines successfully started running 1 pipeline(s).

fdwr · 2024-06-21T18:45:37Z

The Linux errors and orttraining appear unrelated. Merging...

fdwr added 3 commits June 19, 2024 15:30

DML EP EinSum extend more generically

f3a4bee

Polish

46716f6

Polish

17b9461

fdwr requested review from PatriceVignola and smk2007 June 20, 2024 07:20

fdwr commented Jun 20, 2024

View reviewed changes

onnxruntime/core/providers/dml/DmlExecutionProvider/src/TensorDesc.cpp Show resolved Hide resolved

fdwr commented Jun 20, 2024

View reviewed changes

onnxruntime/core/providers/dml/DmlExecutionProvider/src/TensorDesc.cpp Show resolved Hide resolved

fdwr commented Jun 20, 2024

View reviewed changes

onnxruntime/core/providers/dml/OperatorAuthorHelper/OperatorHelper.cpp Show resolved Hide resolved

fdwr commented Jun 20, 2024

View reviewed changes

onnxruntime/test/providers/cpu/math/einsum_test.cc Show resolved Hide resolved

github-advanced-security bot found potential problems Jun 20, 2024

View reviewed changes

onnxruntime/core/providers/dml/DmlExecutionProvider/src/DmlCommon.cpp Dismissed Show resolved Hide resolved

PatriceVignola reviewed Jun 20, 2024

View reviewed changes

onnxruntime/core/providers/dml/OperatorAuthorHelper/OperatorHelper.cpp Outdated Show resolved Hide resolved

PatriceVignola previously approved these changes Jun 20, 2024

View reviewed changes

Polish and lint appeasement

b084e3e

fdwr dismissed PatriceVignola’s stale review via b084e3e June 20, 2024 21:23

fdwr added the ep:DML issues related to the DirectML execution provider label Jun 20, 2024

CR feedback, few more linter nag spots

5804f52

smk2007 previously approved these changes Jun 21, 2024

View reviewed changes

Sheil's feedback - add noexcepts

b3fca8d

fdwr dismissed smk2007’s stale review via b3fca8d June 21, 2024 02:48

smk2007 approved these changes Jun 21, 2024

View reviewed changes

fdwr merged commit ac21626 into main Jun 21, 2024
95 of 99 checks passed

fdwr deleted the users/dwayner/EinSum branch June 21, 2024 18:46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DML EP EinSum make more generic to avoid EP fallback #21114

DML EP EinSum make more generic to avoid EP fallback #21114

fdwr commented Jun 20, 2024 •

edited

Loading

smk2007 commented Jun 20, 2024

fdwr commented Jun 21, 2024

azure-pipelines bot commented Jun 21, 2024

fdwr commented Jun 21, 2024

DML EP EinSum make more generic to avoid EP fallback #21114

DML EP EinSum make more generic to avoid EP fallback #21114

Conversation

fdwr commented Jun 20, 2024 • edited Loading

Problem

Solution

smk2007 commented Jun 20, 2024

fdwr commented Jun 21, 2024

azure-pipelines bot commented Jun 21, 2024

fdwr commented Jun 21, 2024

fdwr commented Jun 20, 2024 •

edited

Loading