Connecting fp16xq4 gemm kernels (optimized for A100) to MatMulNBits<fp16> operator #21083
Azure Pipelines / Big Models
succeeded
Jul 10, 2024 in 1h 56m 26s
Build #20240710.23 succeeded
Loading