Skip to content

Commit

Permalink
added missing tf32 support
Browse files Browse the repository at this point in the history
  • Loading branch information
pemeliya authored and mmakevic-amd committed Nov 4, 2024
1 parent ad55450 commit 5fd2117
Showing 1 changed file with 2 additions and 1 deletion.
3 changes: 2 additions & 1 deletion tensorflow/compiler/xla/stream_executor/gpu/gpu_blas_lt.cc
Original file line number Diff line number Diff line change
Expand Up @@ -96,7 +96,8 @@ xla::StatusOr<ComputationType> GetBlasComputationType(
case DataType::kFloat: // fall-through
if (lhs_dtype == DataType::kHalf) return f16_comp;
if (lhs_dtype == DataType::kBF16) return bf16_comp;
return ComputationType::kF32;
return TF32_Enabled() ? ComputationType::kTF32AsF32
: ComputationType::kF32;
case DataType::kComplexFloat:
return ComputationType::kF32;
case DataType::kDouble: // fall-through
Expand Down

0 comments on commit 5fd2117

Please sign in to comment.