support bf16_optimizer moe expert parallel training and moe EP grad_scale/grad_norm fix #8909
Annotations
1 warning
Node.js 16 actions are deprecated. Please update the following actions to use Node.js 20: actions/checkout@v3. For more information see: https://github.blog/changelog/2023-09-22-github-actions-transitioning-from-node-16-to-node-20/.
|
Loading