This repository has been archived by the owner on Aug 7, 2024. It is now read-only.
[DISCUSSION] fix float8 all-gather in FSDP2 + TP: DTensor(WeightWithDynamicFloat8CastTensor) #326
Facebook GitHub Tools / Facebook CLA Check
succeeded
Aug 1, 2024 in 0s
Contributor License Agreement is valid!
Loading