Add SpaceToDepth and DepthToSpace CUDA NHWC Ops #19646

mtavenrath · 2024-02-26T10:06:55Z

Description

Adding CUDA NHWC support for SpaceToDepth and DepthToSpace
Add a new test which verifies that swizzling SpaceToDepth swizzling for the H axis is correct.
If CUDA NHWC is enabled, run all tests on the CUDA EP with NHWC as well.

Motivation and Context

Adding more NHWC operations to avoid layout transformations when using the CUDA EP for more efficiency.

tianleiwu · 2024-02-26T15:50:09Z

/azp run Windows ARM64 QNN CI Pipeline,Windows x64 QNN CI Pipeline,Windows CPU CI Pipeline,Windows GPU CI Pipeline,Windows GPU TensorRT CI Pipeline,ONNX Runtime Web CI Pipeline,Linux CPU CI Pipeline,Linux CPU Minimal Build E2E CI Pipeline,Linux GPU CI Pipeline,Linux GPU TensorRT CI Pipeline

tianleiwu · 2024-02-26T15:50:10Z

/azp run Linux OpenVINO CI Pipeline,Linux QNN CI Pipeline,MacOS CI Pipeline,orttraining-amd-gpu-ci-pipeline,orttraining-linux-ci-pipeline,orttraining-linux-gpu-ci-pipeline,orttraining-ortmodule-distributed,onnxruntime-python-checks-ci-pipeline,onnxruntime-binary-size-checks-ci-pipeline,Android CI Pipeline

tianleiwu · 2024-02-26T15:50:11Z

/azp run iOS CI Pipeline,ONNX Runtime React Native CI Pipeline

azure-pipelines · 2024-02-26T15:50:26Z

Azure Pipelines successfully started running 2 pipeline(s).

azure-pipelines · 2024-02-26T15:50:47Z

Azure Pipelines successfully started running 9 pipeline(s).

azure-pipelines · 2024-02-26T15:50:48Z

Azure Pipelines successfully started running 10 pipeline(s).

onnxruntime/core/providers/cuda/tensor/space_depth_ops.cc

tianleiwu · 2024-03-01T18:49:49Z

/azp run Windows ARM64 QNN CI Pipeline,Windows x64 QNN CI Pipeline,Windows CPU CI Pipeline,Windows GPU CI Pipeline,Windows GPU TensorRT CI Pipeline,ONNX Runtime Web CI Pipeline,Linux CPU CI Pipeline,Linux CPU Minimal Build E2E CI Pipeline,Linux GPU CI Pipeline,Linux GPU TensorRT CI Pipeline

tianleiwu · 2024-03-01T18:49:50Z

/azp run Linux OpenVINO CI Pipeline,Linux QNN CI Pipeline,MacOS CI Pipeline,orttraining-amd-gpu-ci-pipeline,orttraining-linux-ci-pipeline,orttraining-linux-gpu-ci-pipeline,orttraining-ortmodule-distributed,onnxruntime-binary-size-checks-ci-pipeline,Big Models,Android CI Pipeline

tianleiwu · 2024-03-01T18:49:51Z

/azp run iOS CI Pipeline,ONNX Runtime React Native CI Pipeline

azure-pipelines · 2024-03-01T18:50:06Z

Azure Pipelines successfully started running 2 pipeline(s).

azure-pipelines · 2024-03-01T18:50:25Z

Azure Pipelines successfully started running 10 pipeline(s).

azure-pipelines · 2024-03-01T18:50:30Z

Azure Pipelines successfully started running 10 pipeline(s).

tianleiwu · 2024-03-04T17:46:44Z

/azp run Windows ARM64 QNN CI Pipeline,Windows x64 QNN CI Pipeline,Windows CPU CI Pipeline,Windows GPU CI Pipeline,Windows GPU TensorRT CI Pipeline,ONNX Runtime Web CI Pipeline,Linux CPU CI Pipeline,Linux CPU Minimal Build E2E CI Pipeline,Linux GPU CI Pipeline,Linux GPU TensorRT CI Pipeline

tianleiwu · 2024-03-04T17:46:44Z

/azp run Linux OpenVINO CI Pipeline,Linux QNN CI Pipeline,MacOS CI Pipeline,orttraining-amd-gpu-ci-pipeline,orttraining-linux-ci-pipeline,orttraining-linux-gpu-ci-pipeline,orttraining-ortmodule-distributed,onnxruntime-binary-size-checks-ci-pipeline,Big Models,Android CI Pipeline

tianleiwu · 2024-03-04T17:46:45Z

/azp run iOS CI Pipeline,ONNX Runtime React Native CI Pipeline

azure-pipelines · 2024-03-04T17:47:02Z

Azure Pipelines successfully started running 2 pipeline(s).

azure-pipelines · 2024-03-04T17:47:22Z

Azure Pipelines successfully started running 10 pipeline(s).

azure-pipelines · 2024-03-04T17:47:26Z

Azure Pipelines successfully started running 10 pipeline(s).

tianleiwu · 2024-03-04T20:51:57Z

/azp run Windows ARM64 QNN CI Pipeline,Windows x64 QNN CI Pipeline,Windows CPU CI Pipeline,Windows GPU CI Pipeline,Windows GPU TensorRT CI Pipeline,ONNX Runtime Web CI Pipeline,Linux CPU CI Pipeline,Linux CPU Minimal Build E2E CI Pipeline,Linux GPU CI Pipeline,Linux GPU TensorRT CI Pipeline

tianleiwu · 2024-03-04T20:51:58Z

/azp run Linux OpenVINO CI Pipeline,Linux QNN CI Pipeline,MacOS CI Pipeline,orttraining-amd-gpu-ci-pipeline,orttraining-linux-ci-pipeline,orttraining-linux-gpu-ci-pipeline,orttraining-ortmodule-distributed,onnxruntime-binary-size-checks-ci-pipeline,Big Models,Android CI Pipeline

tianleiwu · 2024-03-04T20:51:59Z

/azp run iOS CI Pipeline,ONNX Runtime React Native CI Pipeline

azure-pipelines · 2024-03-04T20:52:13Z

Azure Pipelines successfully started running 2 pipeline(s).

azure-pipelines · 2024-03-04T20:52:32Z

Azure Pipelines successfully started running 10 pipeline(s).

azure-pipelines · 2024-03-04T20:52:37Z

Azure Pipelines successfully started running 10 pipeline(s).

… cuda_nhwc_kernels

… failing tests.

…linter

tianleiwu · 2024-03-05T21:11:01Z

/azp run Windows ARM64 QNN CI Pipeline,Windows x64 QNN CI Pipeline,Windows CPU CI Pipeline,Windows GPU CI Pipeline,Windows GPU TensorRT CI Pipeline,ONNX Runtime Web CI Pipeline,Linux CPU CI Pipeline,Linux CPU Minimal Build E2E CI Pipeline,Linux GPU CI Pipeline,Linux GPU TensorRT CI Pipeline

tianleiwu · 2024-03-05T21:11:01Z

/azp run Linux OpenVINO CI Pipeline,Linux QNN CI Pipeline,MacOS CI Pipeline,orttraining-amd-gpu-ci-pipeline,orttraining-linux-ci-pipeline,orttraining-linux-gpu-ci-pipeline,orttraining-ortmodule-distributed,onnxruntime-binary-size-checks-ci-pipeline,Big Models,Android CI Pipeline

tianleiwu · 2024-03-05T21:11:02Z

/azp run iOS CI Pipeline,ONNX Runtime React Native CI Pipeline

azure-pipelines · 2024-03-05T21:11:18Z

Azure Pipelines successfully started running 2 pipeline(s).

azure-pipelines · 2024-03-05T21:11:37Z

Azure Pipelines successfully started running 10 pipeline(s).

azure-pipelines · 2024-03-05T21:11:42Z

Azure Pipelines successfully started running 10 pipeline(s).

onnxruntime/core/providers/cuda/tensor/space_depth_ops.cc

tianleiwu · 2024-03-05T23:02:53Z

/azp run Windows ARM64 QNN CI Pipeline,Windows x64 QNN CI Pipeline,Windows CPU CI Pipeline,Windows GPU CI Pipeline,Windows GPU TensorRT CI Pipeline,ONNX Runtime Web CI Pipeline,Linux CPU CI Pipeline,Linux CPU Minimal Build E2E CI Pipeline,Linux GPU CI Pipeline,Linux GPU TensorRT CI Pipeline

tianleiwu · 2024-03-05T23:02:54Z

/azp run Linux OpenVINO CI Pipeline,Linux QNN CI Pipeline,MacOS CI Pipeline,orttraining-amd-gpu-ci-pipeline,orttraining-linux-ci-pipeline,orttraining-linux-gpu-ci-pipeline,orttraining-ortmodule-distributed,onnxruntime-binary-size-checks-ci-pipeline,Big Models,Android CI Pipeline

tianleiwu · 2024-03-05T23:02:54Z

/azp run iOS CI Pipeline,ONNX Runtime React Native CI Pipeline

azure-pipelines · 2024-03-05T23:03:09Z

Azure Pipelines successfully started running 2 pipeline(s).

azure-pipelines · 2024-03-05T23:03:31Z

Azure Pipelines successfully started running 10 pipeline(s).

tianleiwu · 2024-03-06T17:42:46Z

/azp run ONNX Runtime Web CI Pipeline,Linux OpenVINO CI Pipeline,Linux QNN CI Pipeline,MacOS CI Pipeline,orttraining-amd-gpu-ci-pipeline,orttraining-linux-ci-pipeline,orttraining-linux-gpu-ci-pipeline,orttraining-ortmodule-distributed,onnxruntime-binary-size-checks-ci-pipeline,Big Models

tianleiwu · 2024-03-06T17:42:46Z

/azp run Android CI Pipeline,iOS CI Pipeline,ONNX Runtime React Native CI Pipeline

azure-pipelines · 2024-03-06T17:43:30Z

Azure Pipelines successfully started running 10 pipeline(s).

### Description - Adding CUDA NHWC support for SpaceToDepth and DepthToSpace - Add a new test which verifies that swizzling SpaceToDepth swizzling for the H axis is correct. - If CUDA NHWC is enabled, run all tests on the CUDA EP with NHWC as well. ### Motivation and Context Adding more NHWC operations to avoid layout transformations when using the CUDA EP for more efficiency.

hariharans29 · 2024-03-13T02:01:06Z

onnxruntime/test/providers/cpu/tensor/upsample_op_test.cc

@@ -692,7 +692,7 @@ TEST(UpsampleOpTest, NhwcUpsampleOp4D1CBilinearTest) {
  // TensorRT: results mismatch
  // ROCm: results mismatch
  test.Run(OpTester::ExpectResult::kExpectSuccess, "",
-           {kCudaExecutionProvider, kTensorrtExecutionProvider, kRocmExecutionProvider});
+           {kCudaExecutionProvider, kCudaNHWCExecutionProvider, kTensorrtExecutionProvider, kRocmExecutionProvider});


If we are going through this approach for testing - should we get rid of the tests (or bring them over here) in tests/providers/cuda/nhwc ?

The CPU tests tests against all backends except for the excluded ones - and those have been excluded for a reason. We should do the reverse, move cuda/nhwc tests over to the cpu tests for test cases which are not yet covered and remove the specialized nhwc tests.

I agree. We should "unify" (i..e) pick up tests from the nhwc tests over to the cpu tests (after de-deuplication) and get rid of the nhwc tests.

mtavenrath force-pushed the nhwc_depth_to_space branch 3 times, most recently from b683eaf to 521fbbb Compare February 26, 2024 10:34

tianleiwu reviewed Feb 28, 2024

View reviewed changes

onnxruntime/core/providers/cuda/tensor/space_depth_ops.cc Outdated Show resolved Hide resolved

mtavenrath changed the title ~~Add SpaceToDepth CUDA NHWC Ops for Opset 1-12~~ Add SpaceToDepth CUDA NHWC Ops for Opset 1-12 and DepthToSpace NHWC OPs Mar 1, 2024

mtavenrath changed the title ~~Add SpaceToDepth CUDA NHWC Ops for Opset 1-12 and DepthToSpace NHWC OPs~~ Add SpaceToDepth and DepthToSpace CUDA NHWC Ops Mar 1, 2024

mtavenrath added 7 commits March 5, 2024 21:38

Add NHWC support for DepthToSpace

4e06459

Remove unused batchsize

5eefcfb

Move NHWC kernel registration of SpaceToDepth/DepthToSpace kernels to…

eb51034

… cuda_nhwc_kernels

Disable flaky NHWC tests, fix linter issues.

8cf25b6

Disable NHWC Tests for ConvTransposeTest.*. Fix Lintrunner again

16bb8e3

Disable PoolTest.* for CudaNHWCExecutionProvider as it's crashing and…

0dc96b8

… failing tests.

Disable all tests failing with the CudaNHWCExecutionProvider and run …

ca45425

…linter

mtavenrath force-pushed the nhwc_depth_to_space branch from 8330ce3 to ca45425 Compare March 5, 2024 20:41

tianleiwu reviewed Mar 5, 2024

View reviewed changes

onnxruntime/core/providers/cuda/tensor/space_depth_ops.cc Show resolved Hide resolved

tianleiwu reviewed Mar 5, 2024

View reviewed changes

onnxruntime/core/providers/cuda/tensor/space_depth_ops.cc Show resolved Hide resolved

tianleiwu reviewed Mar 5, 2024

View reviewed changes

onnxruntime/core/providers/cuda/tensor/space_depth_ops.cc Outdated Show resolved Hide resolved

Disabling more failing tests & new linter issues

54ac1cb

tianleiwu approved these changes Mar 6, 2024

View reviewed changes

tianleiwu merged commit f2dc725 into microsoft:main Mar 6, 2024
77 checks passed

hariharans29 reviewed Mar 13, 2024

View reviewed changes

Add SpaceToDepth and DepthToSpace CUDA NHWC Ops #19646

Add SpaceToDepth and DepthToSpace CUDA NHWC Ops #19646

Conversation

mtavenrath commented Feb 26, 2024 • edited Loading

Description

Motivation and Context

tianleiwu commented Feb 26, 2024

tianleiwu commented Feb 26, 2024

tianleiwu commented Feb 26, 2024

azure-pipelines bot commented Feb 26, 2024

azure-pipelines bot commented Feb 26, 2024

azure-pipelines bot commented Feb 26, 2024

tianleiwu commented Mar 1, 2024

tianleiwu commented Mar 1, 2024

tianleiwu commented Mar 1, 2024

azure-pipelines bot commented Mar 1, 2024

azure-pipelines bot commented Mar 1, 2024

azure-pipelines bot commented Mar 1, 2024

tianleiwu commented Mar 4, 2024

tianleiwu commented Mar 4, 2024

tianleiwu commented Mar 4, 2024

azure-pipelines bot commented Mar 4, 2024

azure-pipelines bot commented Mar 4, 2024

azure-pipelines bot commented Mar 4, 2024

tianleiwu commented Mar 4, 2024

tianleiwu commented Mar 4, 2024

tianleiwu commented Mar 4, 2024

azure-pipelines bot commented Mar 4, 2024

azure-pipelines bot commented Mar 4, 2024

azure-pipelines bot commented Mar 4, 2024

tianleiwu commented Mar 5, 2024

tianleiwu commented Mar 5, 2024

tianleiwu commented Mar 5, 2024

azure-pipelines bot commented Mar 5, 2024

azure-pipelines bot commented Mar 5, 2024

azure-pipelines bot commented Mar 5, 2024

tianleiwu commented Mar 5, 2024

tianleiwu commented Mar 5, 2024

tianleiwu commented Mar 5, 2024

azure-pipelines bot commented Mar 5, 2024

azure-pipelines bot commented Mar 5, 2024

tianleiwu commented Mar 6, 2024

tianleiwu commented Mar 6, 2024

azure-pipelines bot commented Mar 6, 2024

hariharans29 Mar 13, 2024

Choose a reason for hiding this comment

mtavenrath Mar 13, 2024

Choose a reason for hiding this comment

hariharans29 Mar 13, 2024

Choose a reason for hiding this comment

mtavenrath commented Feb 26, 2024 •

edited

Loading