-
Notifications
You must be signed in to change notification settings - Fork 3k
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
[TensorRT EP] Weightless API integration (#20412)
This PR includes the weight-stripped engine feature (thanks @moraxu for the #20214) which is the major feature for TRT 10 integration. Two TRT EP options are added: - `trt_weight_stripped_engine_enable`: Enable weight-stripped engine build and refit. - `trt_onnx_model_folder_path`: In the quick load case using embedded engine model / EPContext mode, the original onnx filename is in the node's attribute, and this option specifies the directory of that onnx file if needed. Normal weight-stripped engine workflow: ![image](https://github.com/microsoft/onnxruntime/assets/54722500/9f314865-cbda-4979-a7ac-b31c7a553b56) Weight-stripped engine and quick load workflow: ![image](https://github.com/microsoft/onnxruntime/assets/54722500/9f31db51-a7a8-495b-ba25-54c7f904cbad) see the doc [here ](https://onnxruntime.ai/docs/execution-providers/TensorRT-ExecutionProvider.html#tensorrt-ep-caches)for more information about EPContext model. --------- Co-authored-by: yf711 <[email protected]> Co-authored-by: Ye Wang <[email protected]> Co-authored-by: Michal Guzek <[email protected]> Co-authored-by: pengwa <[email protected]> Co-authored-by: wejoncy <[email protected]> Co-authored-by: Yi Zhang <[email protected]> Co-authored-by: Yi Zhang <[email protected]> Co-authored-by: Pranav Sharma <[email protected]> Co-authored-by: Adam Pocock <[email protected]> Co-authored-by: cao lei <[email protected]> Co-authored-by: Adrian Lizarraga <[email protected]> Co-authored-by: inisis <[email protected]> Co-authored-by: Jeff Bloomfield <[email protected]> Co-authored-by: mo-ja <[email protected]> Co-authored-by: kunal-vaishnavi <[email protected]> Co-authored-by: Sumit Agarwal <[email protected]> Co-authored-by: Atanas Dimitrov <[email protected]> Co-authored-by: Justin Chu <[email protected]> Co-authored-by: Yufeng Li <[email protected]> Co-authored-by: Dhruv Matani <[email protected]> Co-authored-by: Dhruv Matani <[email protected]> Co-authored-by: wangshuai09 <[email protected]> Co-authored-by: Xiaoyu <[email protected]> Co-authored-by: Xu Xing <[email protected]> Co-authored-by: Dmitri Smirnov <[email protected]> Co-authored-by: Rachel Guo <[email protected]> Co-authored-by: Sai Kishan Pampana <[email protected]> Co-authored-by: rachguo <[email protected]> Co-authored-by: Jian Chen <[email protected]> Co-authored-by: Shubham Bhokare <[email protected]> Co-authored-by: Yulong Wang <[email protected]> Co-authored-by: Andrew Fantino <[email protected]> Co-authored-by: Thomas Boby <[email protected]> Co-authored-by: Tianlei Wu <[email protected]> Co-authored-by: Scott McKay <[email protected]> Co-authored-by: Michal Guzek <[email protected]> Co-authored-by: George Wu <[email protected]>
- Loading branch information
1 parent
439ed92
commit 454fcdd
Showing
13 changed files
with
331 additions
and
23 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.