Skip to content

Connecting fp16xq4 gemm kernels (optimized for A100) to MatMulNBits<fp16> operator #6339

Connecting fp16xq4 gemm kernels (optimized for A100) to MatMulNBits<fp16> operator

Connecting fp16xq4 gemm kernels (optimized for A100) to MatMulNBits<fp16> operator #6339