Change the repository type filter
All
Repositories list
10 repositories
- An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.
- Code and documentation to train Stanford's Alpaca models, and generate the data.
- A simulation framework for RLHF and alternatives. Develop your RLHF method without collecting human data.