QNN EP: Context cache only works with single partition #18354
Labels
ep:QNN
issues related to QNN exeution provider
platform:mobile
issues related to ONNX Runtime mobile; typically submitted using template
Describe the issue
The context cache
only works if the graph can be executed in a single partition.
onnxruntime + QNN Execution Provider + libQnnHtp.so backend.
If more than one partition is needed, onnxruntime throws an exception in qnn_execution_provider. This restriction does not exist in other environments such as TfLite + QNN delegate.
To reproduce
Run the model attached to #18353 with the following options:
Urgency
No response
Platform
Android
OS Version
34
ONNX Runtime Installation
Built from Source
ONNX Runtime Version or Commit ID
1.16
ONNX Runtime API
C++
Architecture
ARM64
Execution Provider
Other / Unknown
Execution Provider Library Version
QNN: 2.16
The text was updated successfully, but these errors were encountered: