Skip to content

v2.3.1

Compare
Choose a tag to compare
@vpirogov vpirogov released this 04 Aug 20:43
· 27 commits to rls-v2.3 since this release

This is a patch release containing the following changes to v2.3:

  • Improved int8 GEMM performance for processors with Intel AVX2 and Intel DL Boost support (f5c071b)
  • Fixed integer overflow for inner product implementation on CPUs (66971b5)
  • Fixed out of bounds access in GEMM implementation for Intel SSE 4.1 (4e81df0)
  • Fixed correctness issue for depthwise convolution post-op with non-default scales on CPUs (783e1d6, 066c832)
  • Fixed crash for s8 binary primitive on Windows (d9fd397)
  • Fixed performance regression in fp32 to u8 reorder for Intel AMX specific memory formats (97f40cf, 532648a)
  • Fixed correctness issue for bfloat16 convolution weight gradient on processors with Intel AMX support (053406d, 6649b75)
  • Fixed correctness issue for bfloat16 inner product backpropagation on processors with Intel AMX support (a2e6c55)
  • Fixed correctness issue for bfloat16 convolution with padded memory formats on GEN9 GPUs (c0aea07)
  • Fixed correctness issue for int8 matmul primitive with zero points on processors with Intel AMX support (55cb716)
  • Fixed segfault in depthwise convolution post-op on CPUs (ad46635)