We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Update publication
VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit Assignment
vineppo
Amirhossein Kazemnejad
Amirhossein Kazemnejad, Milad Aghajohari, Eva Portelance, Alessandro Sordoni, Siva Reddy, Aaron Courville, Nicolas Le Roux
ArXiv
https://arxiv.org/abs/2410.01679
2024
10
02
No response
The text was updated successfully, but these errors were encountered:
No branches or pull requests
Action
Update publication
Title
VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit Assignment
Shorthand
vineppo
Author
Amirhossein Kazemnejad
Names
Amirhossein Kazemnejad, Milad Aghajohari, Eva Portelance, Alessandro Sordoni, Siva Reddy, Aaron Courville, Nicolas Le Roux
Venue
ArXiv
Link
https://arxiv.org/abs/2410.01679
Year
2024
Month
10
Day
02
Tags
No response
Code
No response
Webpage
No response
Video
No response
Twitter
No response
Demo
No response
Thumbnail
No response
Abstract
No response
The text was updated successfully, but these errors were encountered: