Connecting fp16xq4 gemm kernels (optimized for A100) to MatMulNBits<fp16> operator #21083
Azure Pipelines / Big Models (Stable_Diffusion Stable_Diffusion)
succeeded
Jul 10, 2024 in 9m 44s
Stable_Diffusion Stable_Diffusion succeeded
Loading