Connecting fp16xq4 gemm kernels (optimized for A100) to MatMulNBits<fp16> operator #21083
Azure Pipelines / Big Models (Build_Onnxruntime_Cuda Linux_Build)
succeeded
Jul 10, 2024 in 34m 0s
Build_Onnxruntime_Cuda Linux_Build succeeded
Loading