v0.2.0
What's Changed
-
Support more dense models: GPT-2, Gemma, Qwen2, Mistral.
-
Support fast generation with CUDAgraph.
-
Support distributed experiments with Ray.
-
Bug fixes with the Megatron training backend and the C++ extension.
Please check the updated documentation for details.
Full Changelog: https://github.com/openpsi-project/ReaLHF/commits/v0.2.0