Skip to content

Latest commit

 

History

History
19 lines (11 loc) · 1.74 KB

File metadata and controls

19 lines (11 loc) · 1.74 KB

Asynchronous Inference with OpenVINO™

Binder Colab

This notebook demonstrates how to use the Async API and AsyncInferQueue for asynchronous execution with OpenVINO.

OpenVINO Runtime supports inference in either synchronous or asynchronous mode. The key advantage of the Async API is that when a device is busy with inference, the application can perform other tasks in parallel (for example, populating inputs or scheduling other requests) rather than wait for the current inference to complete first.

With synchronous mode, we wait for the result of the first inference before sending the next request. While the request is being sent, the hardware is idle. When we use the async API, the transfer of the second request is overlapped with the execution of the first inference, and that prevents any hardware idle time. ​

async vs sync

Installation Instructions

This is a self-contained example that relies solely on its own code.
We recommend running the notebook in a virtual environment. You only need a Jupyter server to start. For details, please refer to Installation Guide.