Skip to content

Adding cuda kernel (optimized for sm80) for block-wise 4b quantized float 16 GEMM. #23290

Adding cuda kernel (optimized for sm80) for block-wise 4b quantized float 16 GEMM.

Adding cuda kernel (optimized for sm80) for block-wise 4b quantized float 16 GEMM. #23290

Triggered via pull request January 26, 2024 18:08
Status Success
Total duration 1h 48m 23s
Artifacts

windows.yml

on: pull_request
Fit to window
Zoom out
Zoom in