Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Merge pull request #69 from llauraa23/main
Fix the issue of parameters updated as nan during reward model training.
- Loading branch information