Connecting fp16xq4 gemm kernels (optimized for A100) to MatMulNBits<fp16> operator #28942
Annotations
1 error
Windows-CUDA-12
Canceling since a higher priority waiting request for 'Windows_CI-cfu_transform_prepack' exists
|