Distinguish between DML and the generic 'GPU' term. This is needed for packaging DML EP in the same ORT GPU pkg. #22597

pranavsharma · 2024-10-25T07:48:05Z

Description

We want to package DML EP in the same pkg as ORT GPU pkg. This is one of the changes required to do so. Other 2 changes are

packaging changes (Adding DirectML to existing onnxruntime GPU Nuget Package Pipeline #22395)
delay loading changes in python binding (@snnn)

Motivation and Context

Users want DML, TensorRT and CUDA EP in the same ORT GPU pkg so that they can easily switch between them.

…r packaging DML EP in the same ORT GPU pkg.

fdwr

Looks okay to me. FYI @sumitsays.

include/onnxruntime/core/framework/ortdevice.h

onnxruntime/python/onnxruntime_pybind_state.cc

fdwr

👍 Sumit/Patrice, do you have any concerns?

pranavsharma · 2024-10-25T23:18:58Z

👍 Sumit/Patrice, do you have any concerns?

Looks like there are some DML test failures. I'm investigating. @sumitsays/ @fdwr / @PatriceVignola - if something looks obvious, let me know. thanks!

tianleiwu · 2024-10-28T20:38:20Z

Do we plan to support using DML and CUDA EP in same session? If so, some part like gpu data transfer need support transfer tensors between DML and GPU memory. If not, we might need add some check to prevent the usage.

github-actions

You can commit the suggested changes from lintrunner.

onnxruntime/python/onnxruntime_pybind_ortvalue.cc

snnn · 2024-10-29T00:29:55Z

Do we plan to support using DML and CUDA EP in same session? If so, some part like gpu data transfer need support transfer tensors between DML and GPU memory. If not, we might need add some check to prevent the usage.

No. They cannot be both enabled in the same process.

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

github-actions

You can commit the suggested changes from lintrunner.

onnxruntime/python/onnxruntime_pybind_ortvalue.cc

### Description This change resolves issue No.3 described in #22615

) ### Description  * Leverage template `common-variables.yml` and reduce usage of hardcoded trt_version https://github.com/microsoft/onnxruntime/blob/8391b24447fcca4c01599b3270255fbf76ac8a21/tools/ci_build/github/azure-pipelines/templates/common-variables.yml#L2-L7 * Among all CI yamls, this PR reduces usage of hardcoding trt_version from 40 to 6, by importing trt_version from `common-variables.yml` * Apply TRT 10.5 and re-enable control flow op test ### Motivation and Context  - Reduce usage of hardcoding trt_version among all CI ymls ### Next refactor PR will work on reducing usage of hardcoding trt_version among `.dockerfile`, `.bat` and remaining 2 yml files (download_win_gpu_library.yml & set-winenv.yml, which are step-template yaml that can't import variables)

### Description  Allow some classes to be default constructed. The effect is the same as constructing it with nullptr. Make default ctor visible from the base classes. ### Motivation and Context  Multiple customers complained that when storing Ort::Value in an instance of std::vector, vector can not be resized. We enable that with allowing it default constructed.

### Description Issue can happen with multiple sessions and when ETW captureState / rundown is triggered. Resolves use after free issue. Tested with local unit test creating/destroying multiple sessions while continually enabling & disabling ETW. This currently requires Admin prompt so not checking in ### Motivation and Context ORT should not crash

BUG #22031 Optimize below two situations: 1. Increase workgroupSize if only one workgroup is dispatched. 2. Avoid transpose if not necessary. The overall time of demucs model becomes 106.36 ms from 154.60 ms on my dGPUs with this PR and PR #22577

### Description [DML EP] Update DML to 1.15.4 ### Motivation and Context  We want the customer to use the latest DirectML.

### JSEP Ops that need updating - [x] Cast - [x] ReduceMax - [x] ReduceMin - [x] Squeeze - [x] Unsqueeze - [x] Transpose - [x] AveragePool - [x] Flatten - [x] Pad - [x] If

### Description This PR adds the actual implementation of the WebGPU EP based on #22318. This change includes the following: <details> <summary><b>core framework of WebGPU EP</b></summary> - WebGPU EP factory classes for: - handling WebGPU options - creating WebGPU EP instance - creating WebGPU context - WebGPU Execution Provider classes - GPU Buffer allocator - data transfer - Buffer management classes - Buffer Manager - BufferCacheManager - DisabledCacheManager - SimpleCacheManager - LazyReleaseCacheManager - BucketCacheManager - Program classes - Program (base) - Program Cache Key - Program Manager - Shader helper classes - Shader Helper - ShaderIndicesHelper - ShaderVariableHelper - Utils - GPU Query based profiler - compute context - string utils - Miscs - Python binding webgpu support (basic) </details> <details> <summary><b>Kernel implementation</b></summary> - onnx.ai (default opset): - Elementwise (math): Abs, Neg, Floor, Ceil, Reciprocal, Sqrt, Exp, Erf, Log, Sin, Cos, Tan, Asin, Acos, Atan, Sinh, Cosh, Asinh, Acosh, Atanh, Tanh, Not, Cast - Elementwise (activation): Sigmoid, HardSigmoid, Clip, Elu, Relu, LeakyRelu, ThresholdedRelu, Gelu - Binary (math): Add, Sub, Mul, Div, Pow, Equal, Greater, GreaterOrEqual, Less, LessOrEqual - (Tensors): Shape, Reshape, Squeeze, Unsqueeze - Where - Transpose - Concat - Expand - Gather - Tile - Range - LayerNormalization - com.microsoft - FastGelu - MatMulNBits - MultiHeadAttention - RotaryEmbedding - SkipLayerNormalization - LayerNormalization - SimplifiedLayerNormalization - SkipSimplifiedLayerNormalization </details> <details> <summary><b>Build, test and CI pipeline integration</b></summary> - build works for Windows, macOS and iOS - support onnxruntime_test_all and python node test - added a new unit test for `--use_external_dawn` build flag. - updated MacOS pipeline to build with WebGPU support - added a new pipeline for WebGPU Windows </details> This change does not include: - Node.js binding support for WebGPU (will be a separate PR)

…r packaging DML EP in the same ORT GPU pkg.

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

Distinguish between DML and the generic 'GPU' term. This is needed fo…

7690e26

…r packaging DML EP in the same ORT GPU pkg.

pranavsharma requested review from fdwr and PatriceVignola October 25, 2024 07:48

snnn previously approved these changes Oct 25, 2024

View reviewed changes

snnn mentioned this pull request Oct 25, 2024

Build CUDA and DML together #22602

Merged

fdwr previously approved these changes Oct 25, 2024

View reviewed changes

include/onnxruntime/core/framework/ortdevice.h Show resolved Hide resolved

onnxruntime/python/onnxruntime_pybind_state.cc Show resolved Hide resolved

pranavsharma dismissed stale reviews from fdwr and snnn via 381f55f October 25, 2024 19:36

fdwr previously approved these changes Oct 25, 2024

View reviewed changes

pranavsharma dismissed fdwr’s stale review via 19d7f35 October 28, 2024 07:44

pranavsharma force-pushed the package_dml branch 2 times, most recently from 19d7f35 to 56f2923 Compare October 28, 2024 16:34

github-actions bot requested changes Oct 28, 2024

View reviewed changes

onnxruntime/python/onnxruntime_pybind_ortvalue.cc Outdated Show resolved Hide resolved

Address code review comment

249970f

pranavsharma force-pushed the package_dml branch from 56f2923 to 249970f Compare October 28, 2024 23:31

Update onnxruntime/python/onnxruntime_pybind_ortvalue.cc

ca57674

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

pranavsharma force-pushed the package_dml branch 2 times, most recently from 737a9c6 to a5e52fa Compare October 29, 2024 06:40

github-actions bot requested changes Oct 29, 2024

View reviewed changes

onnxruntime/python/onnxruntime_pybind_ortvalue.cc Outdated Show resolved Hide resolved

Disallow DML EP with other EPs except CPU EP

1c91019

pranavsharma force-pushed the package_dml branch from 37bef64 to 1c91019 Compare October 29, 2024 08:06

fs-eire and others added 6 commits October 29, 2024 04:01

[js/web] remove "node": null in export table (#22618)

dbe8c83

### Description This change resolves issue No.3 described in #22615

indygit and others added 10 commits October 29, 2024 17:13

[JSEP] Upgrade to ONNX Opset 21 (#22595)

5cc7fb4

### JSEP Ops that need updating - [x] Cast - [x] ReduceMax - [x] ReduceMin - [x] Squeeze - [x] Unsqueeze - [x] Transpose - [x] AveragePool - [x] Flatten - [x] Pad - [x] If

Fix lint issue

817a807

Distinguish between DML and the generic 'GPU' term. This is needed fo…

a91cd15

…r packaging DML EP in the same ORT GPU pkg.

Address code review comment

eb737ce

Update onnxruntime/python/onnxruntime_pybind_ortvalue.cc

0cfe36a

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

Disallow DML EP with other EPs except CPU EP

aabeaed

Fix lint issue

0278e92

fix merge conflict

03bc8c1

pranavsharma requested a review from a team as a code owner October 30, 2024 06:27

pranavsharma closed this Oct 30, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Distinguish between DML and the generic 'GPU' term. This is needed for packaging DML EP in the same ORT GPU pkg. #22597

Distinguish between DML and the generic 'GPU' term. This is needed for packaging DML EP in the same ORT GPU pkg. #22597

pranavsharma commented Oct 25, 2024 •

edited

Loading

fdwr left a comment

fdwr left a comment

pranavsharma commented Oct 25, 2024

tianleiwu commented Oct 28, 2024

github-actions bot left a comment

snnn commented Oct 29, 2024

github-actions bot left a comment

Distinguish between DML and the generic 'GPU' term. This is needed for packaging DML EP in the same ORT GPU pkg. #22597

Distinguish between DML and the generic 'GPU' term. This is needed for packaging DML EP in the same ORT GPU pkg. #22597

Conversation

pranavsharma commented Oct 25, 2024 • edited Loading

Description

Motivation and Context

fdwr left a comment

Choose a reason for hiding this comment

fdwr left a comment

Choose a reason for hiding this comment

pranavsharma commented Oct 25, 2024

tianleiwu commented Oct 28, 2024

github-actions bot left a comment

Choose a reason for hiding this comment

snnn commented Oct 29, 2024

github-actions bot left a comment

Choose a reason for hiding this comment

pranavsharma commented Oct 25, 2024 •

edited

Loading