v2.0.0
MerlinKallenbornAA
released this
21 May 09:21
·
296 commits
to main
since this release
2.0.0
Breaking Changes
- Changed the behavior of
IncrementalEvaluator::do_evaluate
such that it now sends allSuccessfulExampleOutput
s todo_incremental_evaluate
instead of only the newSuccessfulExampleOutput
s.
New Features
- Add generic
EloEvaluationLogic
class for implementation of Elo evaluation use cases. - Add
EloQaEvaluationLogic
for Elo evaluation of QA runs, with optional later addition of more runs to an existing evaluation. - Add
EloAggregationAdapter
class to simplify using theComparisonEvaluationAggregationLogic
for different Elo use cases. - Add
elo_qa_eval
tutorial notebook describing the use of an (incremental) Elo evaluation use case for QA models. - Add
how_to_implement_elo_evaluations
how-to as skeleton for implementing Elo evaluation cases
Fixes
ExpandChunks
-task is now fast even for very large documents
Full Changelog: v1.2.0...v2.0.0