[js/webgpu] Provide a vectorized algorithm for GroupedConv #18884

qjia7 · 2023-12-20T07:27:53Z

Description

This PR provides a vectorized algorithm for NHWC GroupedConv to improve performance.

The aggregate time of GroupedConv in mobilenetv2-12 becomes ~1ms from ~4ms on Intel Alder Lake machine. About 20% improvement for the whole model.

qjia7 · 2023-12-20T07:29:58Z

@fs-eire @guschmue @satyajandhyala Please take a look, thanks.

guschmue · 2023-12-22T16:38:31Z

/azp run ONNX Runtime Web CI Pipeline

guschmue · 2023-12-22T16:38:37Z

/azp run Linux CPU CI Pipeline,Linux CPU Minimal Build E2E CI Pipeline,Linux GPU CI Pipeline,Linux GPU TensorRT CI Pipeline,Linux OpenVINO CI Pipeline,Linux QNN CI Pipeline,MacOS CI Pipeline,Windows ARM64 QNN CI Pipeline,Windows CPU CI Pipeline

azure-pipelines · 2023-12-22T16:38:42Z

Azure Pipelines successfully started running 1 pipeline(s).

guschmue · 2023-12-22T16:38:45Z

/azp run Windows GPU CI Pipeline,Windows GPU TensorRT CI Pipeline,onnxruntime-binary-size-checks-ci-pipeline,orttraining-linux-ci-pipeline,orttraining-linux-gpu-ci-pipeline,orttraining-ortmodule-distributed,Windows x64 QNN CI Pipeline

azure-pipelines · 2023-12-22T16:38:48Z

Azure Pipelines could not run because the pipeline triggers exclude this branch/path.

azure-pipelines · 2023-12-22T16:38:59Z

Azure Pipelines successfully started running 1 pipeline(s).

js/web/lib/wasm/jsep/webgpu/ops/conv.ts

fs-eire · 2024-01-04T02:33:18Z

/azp run Windows ARM64 QNN CI Pipeline,Windows x64 QNN CI Pipeline,Windows CPU CI Pipeline,Windows GPU CI Pipeline,Windows GPU TensorRT CI Pipeline,ONNX Runtime Web CI Pipeline,Linux CPU CI Pipeline,Linux CPU Minimal Build E2E CI Pipeline,Linux GPU CI Pipeline,Linux GPU TensorRT CI Pipeline

fs-eire · 2024-01-04T02:33:20Z

/azp run Linux OpenVINO CI Pipeline,Linux QNN CI Pipeline,MacOS CI Pipeline,orttraining-amd-gpu-ci-pipeline,orttraining-linux-ci-pipeline,orttraining-linux-gpu-ci-pipeline,orttraining-ortmodule-distributed,onnxruntime-python-checks-ci-pipeline,onnxruntime-binary-size-checks-ci-pipeline,Android CI Pipeline

fs-eire · 2024-01-04T02:33:21Z

/azp run iOS CI Pipeline,ONNX Runtime React Native CI Pipeline

azure-pipelines · 2024-01-04T02:33:33Z

Azure Pipelines successfully started running 1 pipeline(s).

azure-pipelines · 2024-01-04T02:33:33Z

Azure Pipelines successfully started running 1 pipeline(s).

azure-pipelines · 2024-01-04T02:33:33Z

Azure Pipelines successfully started running 1 pipeline(s).

gyagp · 2024-01-04T02:37:57Z

@fs-eire @guschmue @satyajandhyala any more comments on this?

gyagp · 2024-01-10T00:23:55Z

@guschmue @fs-eire Do you think these unsuccessful checks relevant?

fs-eire · 2024-01-10T01:31:07Z

@guschmue @fs-eire Do you think these unsuccessful checks relevant?

It looks like the unittest are failing:

[webgpu]Conv - conv - vectorize group - B
[webgpu]Conv - conv - vectorize group - D

fs-eire · 2024-01-11T00:08:30Z

I am running the unit tests on my local machine to check if CI reports false error.

fs-eire · 2024-01-11T00:12:38Z

I am running the unit tests on my local machine to check if CI reports false error.

Passed on my local as well. It should be a false error. Let me merge it and watch the main branch.

qjia7 · 2024-01-11T03:28:34Z

@guschmue @fs-eire Do you think these unsuccessful checks relevant?

It looks like the unittest are failing:

[webgpu]Conv - conv - vectorize group - B
[webgpu]Conv - conv - vectorize group - D

I locally tried NV and Intel machines. All cases pass on both of them on latest main. I can't reproduce those failures. Any idea for this?

…icrosoft#18884)" This reverts commit fd6bab4 due to below cases failure on bots [webgpu]Conv - conv - vectorize group - B [webgpu]Conv - conv - vectorize group - D

…#18884) ### Description This PR provides a vectorized algorithm for NHWC GroupedConv to improve performance. The aggregate time of GroupedConv in mobilenetv2-12 becomes ~1ms from ~4ms on Intel Alder Lake machine. About 20% improvement for the whole model.

qjia7 added 4 commits December 20, 2023 13:52

[js/webgpu] Provide a fast path for groupconv when strides=1

76b17e9

add uniform support

4696b0d

nits

8b8b419

use snake case naming

fb96081

guschmue previously approved these changes Dec 22, 2023

View reviewed changes

satyajandhyala reviewed Dec 22, 2023

View reviewed changes

js/web/lib/wasm/jsep/webgpu/ops/conv.ts Show resolved Hide resolved

add test cases

8a0d700

qjia7 dismissed guschmue’s stale review via 8a0d700 December 25, 2023 02:39

qjia7 requested review from satyajandhyala and guschmue December 25, 2023 02:43

satyajandhyala approved these changes Dec 26, 2023

View reviewed changes

fs-eire approved these changes Jan 4, 2024

View reviewed changes

fs-eire merged commit fd6bab4 into microsoft:main Jan 11, 2024
40 of 52 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[js/webgpu] Provide a vectorized algorithm for GroupedConv #18884

[js/webgpu] Provide a vectorized algorithm for GroupedConv #18884

qjia7 commented Dec 20, 2023

qjia7 commented Dec 20, 2023

guschmue commented Dec 22, 2023

guschmue commented Dec 22, 2023

azure-pipelines bot commented Dec 22, 2023

guschmue commented Dec 22, 2023

azure-pipelines bot commented Dec 22, 2023

azure-pipelines bot commented Dec 22, 2023

fs-eire commented Jan 4, 2024

fs-eire commented Jan 4, 2024

fs-eire commented Jan 4, 2024

azure-pipelines bot commented Jan 4, 2024

azure-pipelines bot commented Jan 4, 2024

azure-pipelines bot commented Jan 4, 2024

gyagp commented Jan 4, 2024

gyagp commented Jan 10, 2024

fs-eire commented Jan 10, 2024

fs-eire commented Jan 11, 2024

fs-eire commented Jan 11, 2024

qjia7 commented Jan 11, 2024

[js/webgpu] Provide a vectorized algorithm for GroupedConv #18884

[js/webgpu] Provide a vectorized algorithm for GroupedConv #18884

Conversation

qjia7 commented Dec 20, 2023

Description

qjia7 commented Dec 20, 2023

guschmue commented Dec 22, 2023

guschmue commented Dec 22, 2023

azure-pipelines bot commented Dec 22, 2023

guschmue commented Dec 22, 2023

azure-pipelines bot commented Dec 22, 2023

azure-pipelines bot commented Dec 22, 2023

fs-eire commented Jan 4, 2024

fs-eire commented Jan 4, 2024

fs-eire commented Jan 4, 2024

azure-pipelines bot commented Jan 4, 2024

azure-pipelines bot commented Jan 4, 2024

azure-pipelines bot commented Jan 4, 2024

gyagp commented Jan 4, 2024

gyagp commented Jan 10, 2024

fs-eire commented Jan 10, 2024

fs-eire commented Jan 11, 2024

fs-eire commented Jan 11, 2024

qjia7 commented Jan 11, 2024