New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[CUDA] Add use_tf32 provider option (for FP32 GEMM) #19357

Merged

tianleiwu merged 13 commits into main from tlwu/use_tf32

Feb 6, 2024

+245 −139

Merged

[CUDA] Add use_tf32 provider option (for FP32 GEMM) #19357

undo conv

Azure Pipelines / Big Models (Llama2_ONNX_FP16 Llama2_ONNX_FP16) succeeded Feb 6, 2024 in 13m 12s

Llama2_ONNX_FP16 Llama2_ONNX_FP16 succeeded

0 errors / 0 warnings

View more details on Azure Pipelines