[QNN EP] Enable option to set QNN context priority #18315

HectorSVC · 2023-11-07T06:06:25Z

Enable option qnn_context_priority to set QNN context priority, options: "low", "normal", "normal_high", "high".

Description

Enable option qnn_context_priority to set QNN context priority, options: "low", "normal", "normal_high", "high".

This feature guarantees the model inference with higher priority. Tested with onnxruntime_perf_test tool using same model.

Run the model on the NPU with single instance, the latency is 300ms.
Run the same model on NPU with 2 instance at same time.
Case 1:
both with same priority (high ) -- latency is 600ms
Case 2:
1 with low priority -- latency is 30,000ms
1 with high priority -- latency is 300ms
Case 3:
1 with normal priority -- latency is 15,000ms
1 with high priority -- latency is 300ms

…ow", "normal", "normal_high", "high".

include/onnxruntime/core/session/onnxruntime_c_api.h

…y is detected.

Enable option qnn_context_priority to set QNN context priority, options: "low", "normal", "normal_high", "high". ### Description Enable option qnn_context_priority to set QNN context priority, options: "low", "normal", "normal_high", "high". This feature guarantees the model inference with higher priority. Tested with onnxruntime_perf_test tool using same model. 1. Run the model on the NPU with single instance, the latency is 300ms. 2. Run the same model on NPU with 2 instance at same time. Case 1: both with same priority (high ) -- latency is 600ms Case 2: 1 with low priority -- latency is 30,000ms 1 with high priority -- latency is 300ms Case 3: 1 with normal priority -- latency is 15,000ms 1 with high priority -- latency is 300ms

Enable option qnn_context_priority. QNN context priority, options: "l…

dfc0c04

…ow", "normal", "normal_high", "high".

HectorSVC requested review from adrianlizarraga and jywu-msft November 7, 2023 06:06

Merge branch 'main' into qnn_ctx_priority

cf04ea5

adrianlizarraga reviewed Nov 8, 2023

View reviewed changes

include/onnxruntime/core/session/onnxruntime_c_api.h Outdated Show resolved Hide resolved

HectorSVC added 3 commits November 8, 2023 12:58

Default to normal. Report INVALID_ARGUMENT error if undefined priorit…

85d06ba

…y is detected.

Merge branch 'main' into qnn_ctx_priority

356e17b

add comments for parameters

377b1c0

adrianlizarraga approved these changes Nov 9, 2023

View reviewed changes

HectorSVC merged commit 55c19d6 into main Nov 9, 2023
86 of 90 checks passed

HectorSVC deleted the qnn_ctx_priority branch November 9, 2023 04:56

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[QNN EP] Enable option to set QNN context priority #18315

[QNN EP] Enable option to set QNN context priority #18315

HectorSVC commented Nov 7, 2023 •

edited

Loading

[QNN EP] Enable option to set QNN context priority #18315

[QNN EP] Enable option to set QNN context priority #18315

Conversation

HectorSVC commented Nov 7, 2023 • edited Loading

Description

HectorSVC commented Nov 7, 2023 •

edited

Loading