Skip to content

Connecting fp16xq4 gemm kernels (optimized for A100) to MatMulNBits<fp16> operator #28942

Connecting fp16xq4 gemm kernels (optimized for A100) to MatMulNBits<fp16> operator

Connecting fp16xq4 gemm kernels (optimized for A100) to MatMulNBits<fp16> operator #28942

Triggered via pull request July 2, 2024 21:04
Status Cancelled
Total duration 2h 0m 27s
Artifacts

windows.yml

on: pull_request
Windows-CUDA-12
0s
Windows-CUDA-12
Fit to window
Zoom out
Zoom in

Annotations

1 error
Windows-CUDA-12
Canceling since a higher priority waiting request for 'Windows_CI-cfu_transform_prepack' exists