Connecting fp16xq4 gemm kernels (optimized for A100) to MatMulNBits<fp16> operator #21083
Azure Pipelines / Linux GPU TensorRT CI Pipeline
succeeded
Jul 10, 2024 in 1h 34m 8s
Build #20240710.24 succeeded
Details
- Failed: 0 (0.00%)
- Passed: 4,833 (99.90%)
- Other: 5 (0.10%)
- Total: 4,838
Loading