Skip to content

Connecting fp16xq4 gemm kernels (optimized for A100) to MatMulNBits<fp16> operator #6339

Connecting fp16xq4 gemm kernels (optimized for A100) to MatMulNBits<fp16> operator

Connecting fp16xq4 gemm kernels (optimized for A100) to MatMulNBits<fp16> operator #6339

Triggered via pull request July 2, 2024 21:04
Status Cancelled
Total duration 2h 0m 26s
Artifacts

sca.yml

on: pull_request
Onnxruntime-SCA-training-CUDA
0s
Onnxruntime-SCA-training-CUDA
Onnxruntime-SCA-win32-WINML-x64
0s
Onnxruntime-SCA-win32-WINML-x64
Onnxruntime-SCA-win32-WINML-x86
0s
Onnxruntime-SCA-win32-WINML-x86
Fit to window
Zoom out
Zoom in

Annotations

3 errors
Onnxruntime-SCA-win32-WINML-x86
Canceling since a higher priority waiting request for 'Windows_SCA-cfu_transform_prepack' exists
Onnxruntime-SCA-training-CUDA
Canceling since a higher priority waiting request for 'Windows_SCA-cfu_transform_prepack' exists
Onnxruntime-SCA-win32-WINML-x64
Canceling since a higher priority waiting request for 'Windows_SCA-cfu_transform_prepack' exists