[CUDA] Add use_tf32 provider option (for FP32 GEMM) #19357
+245
−139
Merged
Azure Pipelines / Big Models (Llama2_ONNX_FP16 Llama2_ONNX_FP16)
succeeded
Feb 6, 2024 in 13m 12s
Llama2_ONNX_FP16 Llama2_ONNX_FP16 succeeded
Loading