You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hey guys! For who is interested, I recently submitted a pull request to implements SPPO on Axolotl trainer, you can fallow the pull request here: axolotl-ai-cloud/axolotl#1735
Hey guys! For who is interested, I recently submitted a pull request to implements SPPO on Axolotl trainer, you can fallow the pull request here:
axolotl-ai-cloud/axolotl#1735
Original SPPO implementation fork:
https://github.com/kaykyr/axolotl
See examples/llama3/sppo-qlora-8b.yml config file to see how train SPPO.
The text was updated successfully, but these errors were encountered: