Mlas Gemm 4bit avx2, avx512, and avx512vnni kernels #20163
Merged
Azure Pipelines / orttraining-ortmodule-distributed (DistributedInferenceTest Onnxruntime_Linux_GPU_Inference_Distributed_Test)
succeeded
Apr 26, 2024 in 28m 19s
DistributedInferenceTest Onnxruntime_Linux_GPU_Inference_Distributed_Test succeeded
Loading