We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
0c3aa2c
First public release with basic distributed MoE functions, tested with Megatron-LM and Transformer-XL.