v2.3.1
This is a patch release containing the following changes to v2.3:
- Improved int8 GEMM performance for processors with Intel AVX2 and Intel DL Boost support (f5c071b)
- Fixed integer overflow for inner product implementation on CPUs (66971b5)
- Fixed out of bounds access in GEMM implementation for Intel SSE 4.1 (4e81df0)
- Fixed correctness issue for depthwise convolution post-op with non-default scales on CPUs (783e1d6, 066c832)
- Fixed crash for s8 binary primitive on Windows (d9fd397)
- Fixed performance regression in fp32 to u8 reorder for Intel AMX specific memory formats (97f40cf, 532648a)
- Fixed correctness issue for bfloat16 convolution weight gradient on processors with Intel AMX support (053406d, 6649b75)
- Fixed correctness issue for bfloat16 inner product backpropagation on processors with Intel AMX support (a2e6c55)
- Fixed correctness issue for bfloat16 convolution with padded memory formats on GEN9 GPUs (c0aea07)
- Fixed correctness issue for int8 matmul primitive with zero points on processors with Intel AMX support (55cb716)
- Fixed segfault in depthwise convolution post-op on CPUs (ad46635)