Skip to content

Connecting fp16xq4 gemm kernels (optimized for A100) to MatMulNBits<fp16> operator #41421

Connecting fp16xq4 gemm kernels (optimized for A100) to MatMulNBits<fp16> operator

Connecting fp16xq4 gemm kernels (optimized for A100) to MatMulNBits<fp16> operator #41421