Add an option in OpenVINOProviderOptions to support the queue-based overload for creating ClContext #19699

fireyoshiqc · 2024-02-28T18:10:37Z

Description

Currently, the OpenVINO EP only provides a way to share an OpenCL context (for IO Buffering) through a context pointer, given in its provider options (either the struct or string-map based API).

This is problematic when wanting to use a specific OpenCL queue instead, as there is no way to do so through the current API, even though OpenVINO itself provides an overload for it.

This PR addresses this issue by adding an option to the OpenVINO EP provider options that enables the use of that second overload. This makes it possible to explicitly share OpenCL command queues when using OpenVINO EP.

Motivation and Context

As described in issue #19697.

…IOBuffering with OpenVINO.

fireyoshiqc · 2024-02-28T18:12:51Z

@microsoft-github-policy-service agree

jywu-msft · 2024-03-01T04:33:53Z

+@sfatimar , @preetha-intel

jywu-msft · 2024-03-01T04:34:10Z

/azp run Linux OpenVINO CI Pipeline

azure-pipelines · 2024-03-01T04:34:21Z

Azure Pipelines successfully started running 1 pipeline(s).

preetha-intel · 2024-03-05T07:03:21Z

include/onnxruntime/core/session/onnxruntime_c_api.h

@@ -623,7 +623,8 @@ typedef struct OrtOpenVINOProviderOptions {
                                 cache_dir{},
                                 context{},
                                 enable_opencl_throttling{},
-                                 enable_dynamic_shapes{} {}
+                                 enable_dynamic_shapes{},
+                                 queue{} {}


The struct is frozen and its our legacy API. Kindly upgrade to ORT ProviderOptions Map structure

preetha-intel · 2024-03-05T07:05:26Z

@fireyoshiqc Can you explain on the use case behind OpenCL queue option and its impact over using OpenCL context.

fireyoshiqc · 2024-03-06T14:41:49Z

@fireyoshiqc Can you explain on the use case behind OpenCL queue option and its impact over using OpenCL context.

@preetha-intel
Sure thing. In the project I'm working on (camera image processing pipeline), we are using the OpenVINO API v2.0 for sharing our existing OpenCL queue (used by OpenCV and previously set up on the Intel iGPU) when creating a ClContext for inference on the GPU:

// OpenVINO headers:
    /**
     * @brief Constructs context object from user-supplied OpenCL context handle
     * @param core A reference to OpenVINO Runtime Core object
     * @param queue An OpenCL queue to be used to create shared remote context. Queue will be reused inside the plugin.
     * @note Only latency mode is supported for such context sharing case.
     */
    ClContext(Core& core, cl_command_queue queue) {
        cl_context ctx;
        auto res = clGetCommandQueueInfo(queue, CL_QUEUE_CONTEXT, sizeof(cl_context), &ctx, nullptr);
        OPENVINO_ASSERT(res == CL_SUCCESS, "Can't get context from given opencl queue");
        AnyMap context_params = {{ov::intel_gpu::context_type.name(), ov::intel_gpu::ContextType::OCL},
                                 {ov::intel_gpu::ocl_context.name(), static_cast<gpu_handle_param>(ctx)},
                                 {ov::intel_gpu::ocl_queue.name(), static_cast<gpu_handle_param>(queue)}};
        *this = core.create_context(device_name, context_params).as<ClContext>();

// In our code:
	if( inferOnGPU )
	{
		auto openCvOpenClQueue{ cv::ocl::OpenCLExecutionContext::getCurrent().getQueue() };

		// create the context from the current command queue so that OpenVINO can infer Async with correct scheduling.
		context = std::make_shared<ov::intel_gpu::ocl::ClContext>( core, static_cast<cl_command_queue>( openCvOpenClQueue.ptr() ) );
	}

We wanted to migrate to using ONNX Runtime with the OpenVINO EP to standardize our API across different backends. However, this function isn't "available" to use through the Provider options because only a context can be supplied, not a queue.
Using this constructor overload allows us to ensure that async inference is correctly scheduled after previous GPU processing operations, and the results can be safely used by subsequent GPU operations without explicitly waiting (.wait()) on the inference request. We're using multiple queues (to reduce image copy overhead, for example), so it's important for us to ensure that the right queue is used.

Added new OpenVINO EP provider option to give a cl_command_queue for …

9055bcb

…IOBuffering with OpenVINO.

Added missing check when using StartRemoteAsyncInference

d64f233

preetha-intel reviewed Mar 5, 2024

View reviewed changes

jywu-msft added the ep:OpenVINO issues related to OpenVINO execution provider label Mar 7, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add an option in OpenVINOProviderOptions to support the queue-based overload for creating ClContext #19699

Add an option in OpenVINOProviderOptions to support the queue-based overload for creating ClContext #19699

fireyoshiqc commented Feb 28, 2024

fireyoshiqc commented Feb 28, 2024

jywu-msft commented Mar 1, 2024

jywu-msft commented Mar 1, 2024

azure-pipelines bot commented Mar 1, 2024

preetha-intel Mar 5, 2024

preetha-intel commented Mar 5, 2024

fireyoshiqc commented Mar 6, 2024 •

edited

Loading

Add an option in OpenVINOProviderOptions to support the queue-based overload for creating ClContext #19699

Are you sure you want to change the base?

Add an option in OpenVINOProviderOptions to support the queue-based overload for creating ClContext #19699

Conversation

fireyoshiqc commented Feb 28, 2024

Description

Motivation and Context

fireyoshiqc commented Feb 28, 2024

jywu-msft commented Mar 1, 2024

jywu-msft commented Mar 1, 2024

azure-pipelines bot commented Mar 1, 2024

preetha-intel Mar 5, 2024

Choose a reason for hiding this comment

preetha-intel commented Mar 5, 2024

fireyoshiqc commented Mar 6, 2024 • edited Loading

fireyoshiqc commented Mar 6, 2024 •

edited

Loading