Skip to content

[WIP] Update hybrid model parallel GPT2 example for up to 175B parameters #351

[WIP] Update hybrid model parallel GPT2 example for up to 175B parameters

[WIP] Update hybrid model parallel GPT2 example for up to 175B parameters #351