-
Notifications
You must be signed in to change notification settings - Fork 641
Quick Start
afiaka87 edited this page Apr 22, 2021
·
8 revisions
https://github.com/lucidrains/DALLE-pytorch/wiki/Multi-GPU-and-Multi-Node
https://github.com/lucidrains/DALLE-pytorch/wiki/Vast.ai-Sparse-Attention
https://github.com/lucidrains/DALLE-pytorch/wiki/Attention-Layers
Brand new from microsoft - if you can manage to install it, you'll get support for both parameter and optimizer CPU offloading, as well as a host of other features labelled under the category "ZeRO Stage 3".
Installation is tough to say the least.
Train with Deepspeed ZeRO Infinity