[DmlEP] BeamSearch node is not supported for DmlEP? #18805

trajepl · 2023-12-13T11:04:01Z

Describe the issue

Could not find an implementation for BeamSearch node even if I enable onnxruntime extensions for Dml EP.

To reproduce

Install DmlEP, run whisper example https://github.com/microsoft/Olive/tree/main/examples/whisper.

Remembered to change the ep to

Urgency

No response

Platform

Windows

OS Version

Win11

ONNX Runtime Installation

Released Package

ONNX Runtime Version or Commit ID

1.16.3

ONNX Runtime API

Python

Architecture

X64

Execution Provider

DirectML

Execution Provider Library Version

No response

trajepl · 2023-12-13T13:15:55Z

fdwr · 2023-12-15T03:22:10Z

com.microsoft.BeamSearch isn't supported by the DML Execution Provider (see the DML EP kernel list), but you should still get fallback to the CPU for this operator though. So why that's not happening is the question. I suspect @PatriceVignola would be more familiar with this area (noting you're using Whisper and Olive), since he's been trying a number of transformer models lately.

trajepl · 2023-12-15T03:38:04Z

Thanks for your answer. So sounds like it is hard to [1]run generation task in DmL-gpu if we want to insert beam search node into onnx graph. The expected behavior should fallback to CPU but the performance will be impacted.

I am the Olive contributor actually. Just want to double check if [1] is supported in Dml EP as current example in Olive for Eml use optimum for generation tasks.

trajepl · 2023-12-15T03:39:23Z

As for the failure of fallback, I doubt it is my local env issue. Switched to another linux machine, it worked. So I will close this issue.
Thanks! fdwr

fdwr · 2023-12-15T23:44:49Z

if we want to insert beam search node into onnx graph

Reading com.microsoft.BeamSearch, it doesn't sound like a very GPU-amenable operator, with subgraphs and variable size ngrams and tokens. Hopefully it's an operator than just occurs once or a few times in the graph, which avoids CPU<->GPU transfer stalls.

As for the failure of fallback ... Switched to another linux machine, it worked.

🤔 DML doesn't run on raw Linux. Did you mean using the DML EP atop WSL?

Just want to double check if [1] is supported in Dml EP as current example in Olive

Sounds like the current answer is no 😉, not until the bug is figured out. DML itself has no awareness of BeamSearch, and so it might also be a graph transformer issue outside the EP. I'm sure Pat will get to this matter :).

trajepl · 2023-12-18T04:29:37Z

Thanks! 👍 Yes it seems the graph transformer issue and I will reach our Pat for help if needed.

🤔 DML doesn't run on raw Linux. Did you mean using the DML EP atop WSL?

Oh, I mean the fallback logics not limited to the EP.

github-actions bot added ep:DML issues related to the DirectML execution provider platform:windows issues related to the Windows platform labels Dec 13, 2023

trajepl closed this as completed Dec 15, 2023

trajepl mentioned this issue Dec 15, 2023

Whisper does not converted using onnxruntime-directml microsoft/Olive#813

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[DmlEP] BeamSearch node is not supported for DmlEP? #18805

[DmlEP] BeamSearch node is not supported for DmlEP? #18805

trajepl commented Dec 13, 2023

trajepl commented Dec 13, 2023

fdwr commented Dec 15, 2023

trajepl commented Dec 15, 2023

trajepl commented Dec 15, 2023

fdwr commented Dec 15, 2023 •

edited

Loading

trajepl commented Dec 18, 2023

[DmlEP] BeamSearch node is not supported for DmlEP? #18805

[DmlEP] BeamSearch node is not supported for DmlEP? #18805

Comments

trajepl commented Dec 13, 2023

Describe the issue

To reproduce

Urgency

Platform

OS Version

ONNX Runtime Installation

ONNX Runtime Version or Commit ID

ONNX Runtime API

Architecture

Execution Provider

Execution Provider Library Version

trajepl commented Dec 13, 2023

fdwr commented Dec 15, 2023

trajepl commented Dec 15, 2023

trajepl commented Dec 15, 2023

fdwr commented Dec 15, 2023 • edited Loading

trajepl commented Dec 18, 2023

fdwr commented Dec 15, 2023 •

edited

Loading