Enable user to set QNN HTP performance mode for every session run #19521

HectorSVC · 2024-02-14T18:01:02Z

Description

Currently, the QNN HTP performance mode is set during session creation, there's no way to change it afterwards. There's requirement to set it high performance mode for high priority request and set it back to low performance mode later to save the power when the incoming request is idle for example.

Now, still keeps the performance mode at the session level in QNN EP options which is used at the default one. Ort QNN EP will set it once if user set it.
And there are setting (qnn.htp_perf_mode and qnn.htp_perf_mode_post_run) in run option to change the performance mode before and after session run. There's recommended scenario that user set the mode to high performance mode before the the inference sun so that user can get the result back ASAP. And set the mode to low performance mode after the inference to save the power.

… for each thread as default so user don't need to set it for every session run

pranavsharma

The non-QNN changes look fine.

…nside the thread

HectorSVC · 2024-02-15T23:12:02Z

@zhangsibo1129 , @FFFrog , could you help to take a look at the changes in StreamExecutionContext, I hope it doesn't impact the CANN EP.

FFFrog · 2024-02-16T02:29:39Z

@zhangsibo1129 , @FFFrog , could you help to take a look at the changes in StreamExecutionContext, I hope it doesn't impact the CANN EP.

Thank you for mentioning it. I am on vacation and will give you feedback in time after reading this tomorrow.

onnxruntime/core/providers/qnn/qnn_execution_provider.cc

jywu-msft · 2024-02-16T16:29:03Z

python lint/format check is failing (not sure why it says python when the output is on basic_test.cc) #Resolved

onnxruntime/test/providers/qnn/qnn_basic_test.cc

onnxruntime/core/providers/qnn/qnn_execution_provider.cc

FFFrog · 2024-02-17T09:11:21Z

@zhangsibo1129 , @FFFrog , could you help to take a look at the changes in StreamExecutionContext, I hope it doesn't impact the CANN EP.

@HectorSVC，Everything is ok for CANN EP, thank you again.

jslhcl

chiwwang · 2024-02-29T07:39:41Z

Oh, just a note that, DCVS can be enabled for power-efficient modes as QNN docs suggests.

HectorSVC added 10 commits February 13, 2024 17:25

Enable QNN EP to set HTP power configure for each session run

60017aa

Get default_htp_performance_mode from QNN EP option. set it once only…

032bf19

… for each thread as default so user don't need to set it for every session run

update existing API call for OnRunStart & OnRunEnd to have RunOptions

bfafa7d

fix QNN linux build

0f525c7

fix build errors

91f1c6b

fix build issue for training

5e90641

fix for default perf mode setting

c6dc028

do perf setting for NPU backend only

a318478

fix test failure caused by previous minor changes

f059452

add ORT_DISALLOW_COPY_ASSIGNMENT_AND_MOVE(PerThreadContext)

701e67f

HectorSVC marked this pull request as ready for review February 15, 2024 17:19

HectorSVC requested review from adrianlizarraga, pranavsharma and jywu-msft February 15, 2024 17:19

aungthetnaing approved these changes Feb 15, 2024

View reviewed changes

pranavsharma reviewed Feb 15, 2024

View reviewed changes

update stream_execution_context.cc, update UT to run graph 10 times i…

ec22b3b

…nside the thread

jslhcl reviewed Feb 16, 2024

View reviewed changes

onnxruntime/core/providers/qnn/qnn_execution_provider.cc Outdated Show resolved Hide resolved

jslhcl reviewed Feb 16, 2024

View reviewed changes

onnxruntime/core/providers/qnn/qnn_execution_provider.cc Show resolved Hide resolved

update UT to set each thread with different perf model for session.run

48a4459

fix format issue

1190702

adrianlizarraga reviewed Feb 16, 2024

View reviewed changes

onnxruntime/test/providers/qnn/qnn_basic_test.cc Outdated Show resolved Hide resolved

adrianlizarraga reviewed Feb 16, 2024

View reviewed changes

onnxruntime/core/providers/qnn/qnn_execution_provider.cc Outdated Show resolved Hide resolved

update according review comments.

b4e26bd

jywu-msft reviewed Feb 16, 2024

View reviewed changes

onnxruntime/core/providers/qnn/qnn_execution_provider.cc Show resolved Hide resolved

jywu-msft approved these changes Feb 22, 2024

View reviewed changes

jslhcl approved these changes Feb 23, 2024

View reviewed changes

HectorSVC merged commit 4ab4976 into main Feb 23, 2024
94 checks passed

HectorSVC deleted the qnn_power_cfg_run_option branch February 23, 2024 01:05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enable user to set QNN HTP performance mode for every session run #19521

Enable user to set QNN HTP performance mode for every session run #19521

HectorSVC commented Feb 14, 2024 •

edited

Loading

pranavsharma left a comment

HectorSVC commented Feb 15, 2024

FFFrog commented Feb 16, 2024

jywu-msft commented Feb 16, 2024 •

edited by HectorSVC

Loading

FFFrog commented Feb 17, 2024

jslhcl left a comment

chiwwang commented Feb 29, 2024

Enable user to set QNN HTP performance mode for every session run #19521

Enable user to set QNN HTP performance mode for every session run #19521

Conversation

HectorSVC commented Feb 14, 2024 • edited Loading

Description

pranavsharma left a comment

Choose a reason for hiding this comment

HectorSVC commented Feb 15, 2024

FFFrog commented Feb 16, 2024

jywu-msft commented Feb 16, 2024 • edited by HectorSVC Loading

FFFrog commented Feb 17, 2024

jslhcl left a comment

Choose a reason for hiding this comment

chiwwang commented Feb 29, 2024

HectorSVC commented Feb 14, 2024 •

edited

Loading

jywu-msft commented Feb 16, 2024 •

edited by HectorSVC

Loading