Asynchronous Inference with OpenVINO™

This notebook demonstrates how to use the Async API and AsyncInferQueue for asynchronous execution with OpenVINO.

OpenVINO Runtime supports inference in either synchronous or asynchronous mode. The key advantage of the Async API is that when a device is busy with inference, the application can perform other tasks in parallel (for example, populating inputs or scheduling other requests) rather than wait for the current inference to complete first.

With synchronous mode, we wait for the result of the first inference before sending the next request. While the request is being sent, the hardware is idle. When we use the async API, the transfer of the second request is overlapped with the execution of the first inference, and that prevents any hardware idle time.

Installation Instructions

This is a self-contained example that relies solely on its own code.
We recommend running the notebook in a virtual environment. You only need a Jupyter server to start. For details, please refer to Installation Guide.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Asynchronous Inference with OpenVINO™

Installation Instructions

Files

README.md

Latest commit

History

README.md

File metadata and controls

Asynchronous Inference with OpenVINO™

Installation Instructions