[WebNN EP] Add cache for `MLContext`s in the `WebNNBackend` #22510

egalli · 2024-10-19T08:05:35Z

Description

This change adds a cache of MLContexts keyed by their options to the WebNNBackend. This makes is so that multiple InferenceSessions create with the same options will share the same context.

Motivation and Context

Since MLTensors are tied MLContexts, developer can't easily share tensors between InferenceSession (outside of manually an MLContext and specifying the context options). This leads strange behaviors such as,

const sessionsA = ort.InferenceSession.create(urlA, {
  executionProviders: ["webnn"],
  preferredOutputLocation: "ml-buffer",
});
const sessionsB = ort.InferenceSession.create(urlB, {
  executionProviders: ["webnn"],
});
const temp = await sessionA.run({/* arguments */});
const result = await sessionB.run({"input":temp["output"]}); // ERROR: Failed to execute 'dispatch' on 'MLContext': Invalid inputs: The context of MLGraph doesn't match the context of the MLTensor with name "input".

We encountered this behavior when updating the transformers.js version in the developer preview demos. microsoft/webnn-developer-preview#46

### Description This change adds a cache of `MLContext`s keyed by their options to the `WebNNBackend`. This makes is so that multiple `InferenceSession`s create with the same options will share the same context. ### Motivation and Context Since `MLTensor`s are tied `MLContext`s, developer can't easily share tensors between `InferenceSession` (outside of manually an `MLContext` and specifying the `context` options). This leads strange behaviors such as, ```js const sessionsA = ort.InferenceSession.create(urlA, { executionProviders: ["webnn"], preferredOutputLocation: "ml-buffer", }); const sessionsB = ort.InferenceSession.create(urlB, { executionProviders: ["webnn"], }); const temp = await sessionA.run({/* arguments */}); const result = await sessionB.run({"input":temp["output"]}); // ERROR: Failed to execute 'dispatch' on 'MLContext': Invalid inputs: The context of MLGraph doesn't match the context of the MLTensor with name "input". ``` We encountered this behavior when updating the transformers.js version in the developer preview demos. microsoft/webnn-developer-preview#46

egalli · 2024-10-22T20:33:05Z

@fdwr, @guschmue PTAL

egalli · 2024-10-22T20:34:10Z

Also, @Honry PTAL

Honry

LGTM, thanks!

js/web/lib/wasm/jsep/backend-webnn.ts

fdwr · 2024-10-23T03:31:18Z

/azp run ONNX Runtime Web CI Pipeline,Windows GPU CI Pipeline,Linux Android Emulator QNN CI Pipeline

fdwr · 2024-10-23T03:31:21Z

/azp run Linux CPU CI Pipeline,Linux CPU Minimal Build E2E CI Pipeline,Linux GPU CI Pipeline,Linux GPU TensorRT CI Pipeline,Linux OpenVINO CI Pipeline,Linux QNN CI Pipeline,MacOS CI Pipeline,Windows ARM64 QNN CI Pipeline,Windows CPU CI Pipeline

fdwr · 2024-10-23T03:31:23Z

/azp run Windows GPU CUDA CI Pipeline,Windows GPU DML CI Pipeline,Windows GPU Doc Gen CI Pipeline

fdwr · 2024-10-23T03:31:25Z

/azp run Windows GPU TensorRT CI Pipeline,onnxruntime-binary-size-checks-ci-pipeline,orttraining-linux-ci-pipeline,orttraining-linux-gpu-ci-pipeline,orttraining-ortmodule-distributed,Windows x64 QNN CI Pipeline,Big Models

azure-pipelines · 2024-10-23T03:31:33Z

Azure Pipelines successfully started running 2 pipeline(s).

azure-pipelines · 2024-10-23T03:31:37Z

Azure Pipelines successfully started running 3 pipeline(s).

azure-pipelines · 2024-10-23T03:31:49Z

Azure Pipelines successfully started running 6 pipeline(s).

azure-pipelines · 2024-10-23T03:31:54Z

Azure Pipelines successfully started running 9 pipeline(s).

* Changed to a clearer and more robust way to compare `MLContextOption`s

fdwr

👍

fdwr · 2024-10-23T21:32:21Z

/azp run ONNX Runtime Web CI Pipeline,Windows GPU CI Pipeline,Linux Android Emulator QNN CI Pipeline

fdwr · 2024-10-23T21:32:24Z

/azp run Linux CPU CI Pipeline,Linux CPU Minimal Build E2E CI Pipeline,Linux GPU CI Pipeline,Linux GPU TensorRT CI Pipeline,Linux OpenVINO CI Pipeline,Linux QNN CI Pipeline,MacOS CI Pipeline,Windows ARM64 QNN CI Pipeline,Windows CPU CI Pipeline

fdwr · 2024-10-23T21:32:27Z

/azp run Windows GPU CUDA CI Pipeline,Windows GPU DML CI Pipeline,Windows GPU Doc Gen CI Pipeline

fdwr · 2024-10-23T21:32:30Z

/azp run Windows GPU TensorRT CI Pipeline,onnxruntime-binary-size-checks-ci-pipeline,orttraining-linux-ci-pipeline,orttraining-linux-gpu-ci-pipeline,orttraining-ortmodule-distributed,Windows x64 QNN CI Pipeline,Big Models

azure-pipelines · 2024-10-23T21:32:36Z

Azure Pipelines successfully started running 2 pipeline(s).

azure-pipelines · 2024-10-23T21:32:41Z

Azure Pipelines successfully started running 3 pipeline(s).

azure-pipelines · 2024-10-23T21:32:51Z

Azure Pipelines successfully started running 4 pipeline(s).

azure-pipelines · 2024-10-23T21:33:00Z

Azure Pipelines successfully started running 9 pipeline(s).

fdwr · 2024-10-24T03:51:07Z

~~Enrico, seems like a legitimate ORT Web CI error (though I'm not seeing an obvious link from your change to a WebGPU failure):~~

    [webgpu]GroupQueryAttention - GroupQueryAttention PackedQKV 15
      × T[0]
        Chrome Headless 130.0.0.0 (Windows 10)
      AssertionError: tensor data should match: expected false to be true

https://dev.azure.com/onnxruntime/onnxruntime/_build/results?buildId=1534194&view=logs&j=4cf9212c-8936-5e77-cbdb-290c1c5567eb&t=f6ca5817-a7c2-5469-2496-2df89aa89689

(update) Never mind - I see the failure in #22181 too.

guschmue · 2024-10-30T00:22:16Z

/azp run ONNX Runtime Web CI Pipeline,ONNX Runtime Web CI Pipeline (Build_web_Release build_onnxruntime_web)

azure-pipelines · 2024-10-30T00:22:28Z

Azure Pipelines successfully started running 1 pipeline(s).

fdwr · 2024-10-30T01:44:49Z

/azp run ONNX Runtime Web CI Pipeline, ONNX Runtime Web CI Pipeline (Build_web_Release build_onnxruntime_web)

azure-pipelines · 2024-10-30T01:45:01Z

Azure Pipelines successfully started running 1 pipeline(s).

…t#22510) ### Description This change adds a cache of `MLContext`s keyed by their options to the `WebNNBackend`. This makes is so that multiple `InferenceSession`s create with the same options will share the same context. ### Motivation and Context Since `MLTensor`s are tied `MLContext`s, developer can't easily share tensors between `InferenceSession` (outside of manually an `MLContext` and specifying the `context` options). This leads strange behaviors such as, ```js const sessionsA = ort.InferenceSession.create(urlA, { executionProviders: ["webnn"], preferredOutputLocation: "ml-buffer", }); const sessionsB = ort.InferenceSession.create(urlB, { executionProviders: ["webnn"], }); const temp = await sessionA.run({/* arguments */}); const result = await sessionB.run({"input":temp["output"]}); // ERROR: Failed to execute 'dispatch' on 'MLContext': Invalid inputs: The context of MLGraph doesn't match the context of the MLTensor with name "input". ``` We encountered this behavior when updating the transformers.js version in the developer preview demos. microsoft/webnn-developer-preview#46

egalli mentioned this pull request Oct 19, 2024

[Image Classification] Failed to execute 'dispatch' on 'MLContext': Invalid inputs microsoft/webnn-developer-preview#46

Closed

Honry approved these changes Oct 23, 2024

View reviewed changes

fdwr reviewed Oct 23, 2024

View reviewed changes

js/web/lib/wasm/jsep/backend-webnn.ts Outdated Show resolved Hide resolved

egalli added 2 commits October 23, 2024 10:54

PR feedback

96e116b

* Changed to a clearer and more robust way to compare `MLContextOption`s

Merge remote-tracking branch 'origin/main' into fix_demo_inputs_double

5190109

fdwr approved these changes Oct 23, 2024

View reviewed changes

fdwr requested a review from guschmue October 24, 2024 03:52

guschmue added the ep:WebNN WebNN execution provider label Oct 30, 2024

fdwr merged commit df236c7 into microsoft:main Oct 30, 2024
72 checks passed

ibelem mentioned this pull request Dec 11, 2024

Switching backends yields error - Failed to execute 'dispatch' on 'MLContext': Invalid inputs: The context of MLGraph doesn't match the context of the MLTensor with name "pixel_values" microsoft/webnn-developer-preview#69

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WebNN EP] Add cache for `MLContext`s in the `WebNNBackend` #22510

[WebNN EP] Add cache for `MLContext`s in the `WebNNBackend` #22510

egalli commented Oct 19, 2024

egalli commented Oct 22, 2024

egalli commented Oct 22, 2024

Honry left a comment

fdwr commented Oct 23, 2024

fdwr commented Oct 23, 2024

fdwr commented Oct 23, 2024

fdwr commented Oct 23, 2024

azure-pipelines bot commented Oct 23, 2024

azure-pipelines bot commented Oct 23, 2024

azure-pipelines bot commented Oct 23, 2024

azure-pipelines bot commented Oct 23, 2024

fdwr left a comment

fdwr commented Oct 23, 2024

fdwr commented Oct 23, 2024

fdwr commented Oct 23, 2024

fdwr commented Oct 23, 2024

azure-pipelines bot commented Oct 23, 2024

azure-pipelines bot commented Oct 23, 2024

azure-pipelines bot commented Oct 23, 2024

azure-pipelines bot commented Oct 23, 2024

fdwr commented Oct 24, 2024 •

edited

Loading

guschmue commented Oct 30, 2024

azure-pipelines bot commented Oct 30, 2024

fdwr commented Oct 30, 2024

azure-pipelines bot commented Oct 30, 2024

[WebNN EP] Add cache for MLContexts in the WebNNBackend #22510

[WebNN EP] Add cache for MLContexts in the WebNNBackend #22510

Conversation

egalli commented Oct 19, 2024

Description

Motivation and Context

egalli commented Oct 22, 2024

egalli commented Oct 22, 2024

Honry left a comment

Choose a reason for hiding this comment

fdwr commented Oct 23, 2024

fdwr commented Oct 23, 2024

fdwr commented Oct 23, 2024

fdwr commented Oct 23, 2024

azure-pipelines bot commented Oct 23, 2024

azure-pipelines bot commented Oct 23, 2024

azure-pipelines bot commented Oct 23, 2024

azure-pipelines bot commented Oct 23, 2024

fdwr left a comment

Choose a reason for hiding this comment

fdwr commented Oct 23, 2024

fdwr commented Oct 23, 2024

fdwr commented Oct 23, 2024

fdwr commented Oct 23, 2024

azure-pipelines bot commented Oct 23, 2024

azure-pipelines bot commented Oct 23, 2024

azure-pipelines bot commented Oct 23, 2024

azure-pipelines bot commented Oct 23, 2024

fdwr commented Oct 24, 2024 • edited Loading

guschmue commented Oct 30, 2024

azure-pipelines bot commented Oct 30, 2024

fdwr commented Oct 30, 2024

azure-pipelines bot commented Oct 30, 2024

[WebNN EP] Add cache for `MLContext`s in the `WebNNBackend` #22510

[WebNN EP] Add cache for `MLContext`s in the `WebNNBackend` #22510

fdwr commented Oct 24, 2024 •

edited

Loading