v4.1.0
MerlinKallenbornAA
released this
17 Jun 12:13
·
233 commits
to main
since this release
4.1.0
New Features
- Add
eot_token
property toControlModel
and derived classes (LuminousControlModel
,Llama2InstructModel
andLlama3InstructModel
) and letPromptBasedClassify
use this property instead of a hardcoded string. - Introduce a new argilla client
ArgillaWrapperClient
. This uses theargilla
package as a connection to argilla and supports all question types that argilla supports in theirFeedbackDataset
. This includes text and yes/no questions. For more information about the questions, check their official documentation.- Changes to switch:
DefaultArgillaClient
->ArgillaWrapperClient
Question
->argilla.RatingQuestion
,options
->values
and it takes only a listField
->argilla.TextField
- Changes to switch:
- Add
description
parameter toAggregator.aggregate_evaluation
to allow individual descriptions without the need to create a newAggregator
. This was missing from the previous release. - Add optional field
metadata
toDataset
,RunOverview
,EvaluationOverview
andAggregationOverview
- Update
parameter_optimization.ipynb
to demonstrate usage of metadata****
- Update
- Add optional field
label
toDataset
,RunOverview
,EvaluationOverview
andAggregationOverview
- Add
unwrap_metadata
flag toaggregation_overviews_to_pandas
to enable inclusion of metadata in pandas export. Defaults to True.
Fixes
- Reinitializing different
AlephAlphaModel
instances and retrieving their tokenizer should now consume a lot less memory. - Evaluations now raise errors if ids of examples and outputs no longer match. If this happens, continuing the evaluation would only produce incorrect results.
- Performing evaluations on runs with a different number of outputs now raises errors. Continuing the evaluation in this case would only lead to an inconsistent state.
Full Changelog: v4.0.1...v4.1.0