Skip to content

reduce all-to-all communication volume when both expert and non-expert are tensor-parallel #608

reduce all-to-all communication volume when both expert and non-expert are tensor-parallel

reduce all-to-all communication volume when both expert and non-expert are tensor-parallel #608

Triggered via pull request July 20, 2024 23:16
Status Success
Total duration 48m 35s
Artifacts

hpu-gaudi2.yml

on: pull_request
Fit to window
Zoom out
Zoom in