Skip to content

Commit

Permalink
Fix TF32 feature registration
Browse files Browse the repository at this point in the history
  • Loading branch information
wingertge committed Nov 25, 2024
1 parent 7398dec commit 1474ce5
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion crates/cubecl-cpp/src/cuda/wmma/cuda_compiler.rs
Original file line number Diff line number Diff line change
Expand Up @@ -91,7 +91,7 @@ impl WmmaCompiler<CudaDialect<Self>> for CudaWmmaCompiler {
gpu::Elem::Float(gpu::FloatKind::TF32),
gpu::Elem::Float(gpu::FloatKind::TF32),
gpu::Elem::Float(gpu::FloatKind::F32),
vec![(16, 8, 16)],
vec![(16, 16, 8)],
));
}
result
Expand Down

0 comments on commit 1474ce5

Please sign in to comment.