This repository has been archived by the owner on Aug 7, 2024. It is now read-only.
Checkpoint to reduce fp8_weight tensor saved for backwards#193
Open
drisspg wants to merge 1 commit intomainfrom checkpointing_weight_cast
+31-3