New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Add device tensor documentation for GPU execution providers #20837

Merged

tianleiwu merged 3 commits into microsoft:gh-pages from gedoensmax:doc_device_tensor

Jun 12, 2024

Contributor

gedoensmax commented May 28, 2024

This documentation adds documentation on:

how to allocate CUDA device tensors from C++ and python
how to use DML device tensors from C++ and python
it also shows how to leverage existing GPU allocations in ORT
how to overlap PCI copies and GPU execution using CUDA streams
how to overlap PCI copies and GPU execution using D3D12 Command Lists and custom resources


          Add device tensor documentation

a222685

Contributor Author

gedoensmax commented May 28, 2024

@PatriceVignola can you help review this for DML ?
@tianleiwu can you help review this for CUDA ?

tianleiwu reviewed

View reviewed changes

docs/performance/device-tensor.md Outdated Show resolved Hide resolved

tianleiwu reviewed

View reviewed changes

docs/performance/device-tensor.md Outdated Show resolved Hide resolved

tianleiwu reviewed

View reviewed changes

docs/performance/device-tensor.md Outdated Show resolved Hide resolved

tianleiwu reviewed

View reviewed changes

docs/performance/device-tensor.md Outdated Show resolved Hide resolved


          Apply suggestions from tianleiwu's review

69db4a2

Co-authored-by: Tianlei Wu <[email protected]>

tianleiwu reviewed

View reviewed changes

docs/performance/device-tensor.md Outdated Show resolved Hide resolved

tianleiwu reviewed

View reviewed changes

docs/performance/device-tensor.md Outdated Show resolved Hide resolved

tianleiwu reviewed

View reviewed changes

docs/performance/device-tensor.md Outdated Show resolved Hide resolved

PatriceVignola reviewed

View reviewed changes

docs/performance/device-tensor.md Outdated Show resolved Hide resolved

PatriceVignola reviewed

View reviewed changes

docs/performance/device-tensor.md Outdated Show resolved Hide resolved

PatriceVignola reviewed

View reviewed changes

docs/performance/tune-performance/iobinding.md Outdated Show resolved Hide resolved

Contributor

PatriceVignola commented May 29, 2024

DirectML part looks good to me.


          fixing typos

7c520f5

tianleiwu approved these changes

View reviewed changes

Contributor Author

gedoensmax commented Jun 12, 2024

When will this be merged ? Is there anything else needed from my side ?

tianleiwu merged commit f20db76 into microsoft:gh-pages

1 of 2 checks passed

tianleiwu mentioned this pull request

[Doc] Fix links in Device Tensor Doc #21039

Merged

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet