Adding cuda kernel (optimized for sm80) for block-wise 4b quantized float 16 GEMM. #18619
Azure Pipelines / Windows CPU CI Pipeline
succeeded
Feb 29, 2024 in 1h 9m 48s
Build #20240228.29 succeeded
Details
- Failed: 0 (0.00%)
- Passed: 58,787 (98.25%)
- Other: 1,050 (1.75%)
- Total: 59,837
Loading