Skip to content

v2.0.0

Compare
Choose a tag to compare
@MerlinKallenbornAA MerlinKallenbornAA released this 21 May 09:21
· 296 commits to main since this release

2.0.0

Breaking Changes

  • Changed the behavior of IncrementalEvaluator::do_evaluate such that it now sends all SuccessfulExampleOutputs to do_incremental_evaluate instead of only the new SuccessfulExampleOutputs.

New Features

  • Add generic EloEvaluationLogic class for implementation of Elo evaluation use cases.
  • Add EloQaEvaluationLogic for Elo evaluation of QA runs, with optional later addition of more runs to an existing evaluation.
  • Add EloAggregationAdapter class to simplify using the ComparisonEvaluationAggregationLogic for different Elo use cases.
  • Add elo_qa_eval tutorial notebook describing the use of an (incremental) Elo evaluation use case for QA models.
  • Add how_to_implement_elo_evaluations how-to as skeleton for implementing Elo evaluation cases

Fixes

  • ExpandChunks-task is now fast even for very large documents

Full Changelog: v1.2.0...v2.0.0