oneAPI-v1.2-rev-1
rscohn2
released this
10 Nov 17:26
·
109 commits
to main
since this release
Changes since 1.1
-
SYCL
The following extensions were added:
- sycl_ext_oneapi_assert - Support for device-side assert.
- sycl_ext_oneapi_default_context - Adds the concept of a platform default context.
- sycl_ext_oneapi_discard_queue_events - Adds a queue property that can optimize queues in some circumstances.
- sycl_ext_oneapi_srgb - Exposes sRGB support for images.
- sycl_ext_oneapi_usm_device_read_only - Adds a property for USM allocations.
-
oneDPL
The following updates were added in oneDPL specification for version 1.2:
- The content was reorganized.
- API for random number generation was added.
- Incremental improvements and bug fixes.
-
oneDNN
This is a new major release of oneDNN spec, which breaks compatibility with previously published versions.
- oneDNN Graph extension: a graph extension is added to allow seamless fusion of operations, and more flexibility for
backend specific optimizations. - reworked quantization workflow: in order to support dynamic quantization efficiently and allow better reuse of
primitive objects, quantization parameters are no longer passed at primitive creation, but at primitive execution.
This also allows to pass quantization parameters from device memory, instead of passing them from host memory. - opaque memory descriptors, and removal of operation descriptors: this allows more flexibility for oneDNN
implementation to add new memory layouts and primitive extensions without breaking compatibility. - Better support for type conversion fusion: all primitives now take separate descriptors for input and output,
which allows to fuse type conversions to all primitives.
- oneDNN Graph extension: a graph extension is added to allow seamless fusion of operations, and more flexibility for
-
Level Zero
-
oneTBB
The following updates were added in oneTBB specification for version 1.2:
- Support for core types and thread-per-core limit was added to task_arena constraints.
- API of concurrent_queue and concurrent_bounded_queue was extended to better match C++ standard containers.
- Incremental improvements and bug fixes.
-
oneVPL
This release updates oneVPL specification to version 2.8.0. New features include:
- Encode statistics API: A new API is introduced to provide access to encoder statistics at frame, tile,
slice, or block levels. - Improved multi-adapter support: Extended device ID reporting is added to provide additional properties
for device matching and identification outside of oneVPL. - Priority loading option for custom libraries: The environment variable ONEVPL_PRIORITY_PATH may be used to
force loading of user-defined libraries with highest priority. - ONEVPL_EXPERIMENTAL macro for introduction of experimental API features: Applications may access experimental
API features by defining ONEVPL_EXPERIMENTAL at build time. - Session initialization controls: New dispatcher configuration properties are introduced to set the number of
threads, configure the device copy mode, or attach extension buffers during session initialization. - Camera Processing API: A new API is introduced for processing Camera RAW data.
- Support for new video color formats: New color formats are added including 4:4:4 XYUV and
16-bit floating-point ABGR.
- Encode statistics API: A new API is introduced to provide access to encoder statistics at frame, tile,
-
oneMKL
The following updates were added in oneMKL specification for version 1.2:
- Dense matrix copy and transpose routines were added in the BLAS-like extensions
- half/bfloat16 precision support were added to several L1 BLAS routines
- The supported precisions for BLAS gemm and gemm_batch were updated
- Several routines in BLAS had const attributes properly assigned to arguments
- Add a missing constraint on parameter "n" for LAPACK orgqr routines
- Improve directories tree of VM, RNG, Stats domains of oneMKL. Fix minor issues in RNG
- Other changes include minor clarifications and bug fixes.