Release v0.5.4
What's Changed
- Add Qwen1.5-72B-Chat to AlpacaEval by @Lukeming-tsinghua in #226
- Add claude-instant-1.2, deepseek-llm-67b-chat, wizardlm-70b, Qwen-14B-Chat (config + outputs without annotations) by @gblazex in #228
- [DATA] Adding annotations for the arena models by @YannDubs in #229
- Update README.md - Add missing "Y" to "ou" by @yoderj in #230
- [DEV] Analyzing length-controlled metrics. by @YannDubs in #231
- [DOC] add annotation interpretation by @YannDubs in #232
- [DATA] add results from the Arena openai models by @YannDubs in #234
- update ELO for llama-2-13b-chat-hf by @gblazex in #235
- [NOTEBOOK] add length-corrected GLM by @YannDubs in #237
- [ENH] add inverse mapper to make sure in and out types are the same by @YannDubs in #240
- [ENH] update to allow AF to use AE by @YannDubs in #241
New Contributors
- @Lukeming-tsinghua made their first contribution in #226
- @yoderj made their first contribution in #230
Full Changelog: v0.5.3...v0.5.4