Adding cuda kernel (optimized for sm80) for block-wise 4b quantized float 16 GEMM. #18619
Azure Pipelines / Linux GPU CI Pipeline (Linux_Test Linux_Test)
succeeded
Feb 29, 2024 in 1h 0m 40s
Linux_Test Linux_Test succeeded
Loading