Skip to content

Adding cuda kernel (optimized for sm80) for block-wise 4b quantized float 16 GEMM. #32189

Adding cuda kernel (optimized for sm80) for block-wise 4b quantized float 16 GEMM.

Adding cuda kernel (optimized for sm80) for block-wise 4b quantized float 16 GEMM. #32189

Triggered via pull request February 28, 2024 21:01
Status Success
Total duration 33s
Artifacts
Validation
22s
Validation
Fit to window
Zoom out
Zoom in