Adding cuda kernel (optimized for sm80) for block-wise 4b quantized float 16 GEMM. #18619
Azure Pipelines / ONNX Runtime Web CI Pipeline (Precheck_and_extract_commit Precheck_and_extract_commit)
succeeded
Feb 28, 2024 in 2m 12s
Precheck_and_extract_commit Precheck_and_extract_commit succeeded
Loading