Skip to content

reduce all-to-all communication volume when both expert and non-expert are tensor-parallel #511

reduce all-to-all communication volume when both expert and non-expert are tensor-parallel

reduce all-to-all communication volume when both expert and non-expert are tensor-parallel #511

Triggered via pull request July 22, 2024 19:40
@loadamsloadams
synchronize #5626
myfeature
Status Success
Total duration 2h 9m 8s
Artifacts

nv-ds-chat.yml

on: pull_request
Fit to window
Zoom out
Zoom in