Skip to content

Intel DML v1.2.0

Latest
Compare
Choose a tag to compare
@abdelrahim-hentabli abdelrahim-hentabli released this 26 Sep 23:04
f59ed47

Functionality

  • Introduced a new internal submission mechanism for platforms based on Linux* OS kernel versions where MMAP is no longer permitted. For more details, refer to the Intel Security Advisory. When MMAP is unavailable, the write system call is used instead. This may introduce additional overhead for smaller data sizes (smaller than 16KB), that results in slightly higher Latency and lower Throughput.
  • Updated the DML device search mechanism to a new default behavior. Now, the platforms with Sub-NUMA clustering configured such that not all NUMA nodes have an accelerator instance can utilize any DSA instance from the same socket for execution. If more fine-grained control is needed, the Low-Level API of the library provides the ability to select devices from a specific NUMA node using the numa_id field in the job structure.
  • Introduced a new Low-Level API function dml_batch_get_crc() which retrieves the resulting CRC from a CRC operation.

Usability and Documentation

  • Extended examples to use new operation dml_batch_get_crc() and also to clarify use of crc seed for CRC operation.

Known Limitations

  • Intel(R) DML could be built from directly downloadable files (.tar, .tgz) only without tests and benchmark frameworks, using the -DDML_BUILD_TESTS=OFF build option since they require submodules that are not included in the archives by GitHub* during release creation.
  • Delta Record operations are not currently supported on the hardware_path.
  • Batch operation is currently not supported for the platforms based on Linux* OS kernel versions where MMAP is not permitted.
  • Known test failures are listed below:
    • block_on_fault/apply_delta_page_fault.read/1