Skip to content

support bf16_optimizer moe expert parallel training and moe EP grad_scale/grad_norm fix #8909

support bf16_optimizer moe expert parallel training and moe EP grad_scale/grad_norm fix

support bf16_optimizer moe expert parallel training and moe EP grad_scale/grad_norm fix #8909

Annotations

1 warning

unit-tests

succeeded Mar 15, 2024 in 5m 26s