Skip to content

v0.2.0

Compare
Choose a tag to compare
@garrett4wade garrett4wade released this 04 Jul 10:14
· 35 commits to main since this release
46d7dc3

What's Changed

  • Support more dense models: GPT-2, Gemma, Qwen2, Mistral.

  • Support fast generation with CUDAgraph.

  • Support distributed experiments with Ray.

  • Bug fixes with the Megatron training backend and the C++ extension.

Please check the updated documentation for details.

Full Changelog: https://github.com/openpsi-project/ReaLHF/commits/v0.2.0