Adding cuda kernel (optimized for sm80) for block-wise 4b quantized float 16 GEMM. #18619
Azure Pipelines / Linux CPU CI Pipeline
succeeded
Feb 29, 2024 in 57m 37s
Build #20240228.28 succeeded
Details
- Failed: 0 (0.00%)
- Passed: 32,263 (99.93%)
- Other: 22 (0.07%)
- Total: 32,285
Loading