distilabel-helm-instruct-adaptable-evaluation-criteria

A repo that implements Stanford CRFM their HELM Instruct with adaptable evaluation criteria

Install

python -m venv .venv
source .venv/bin/activate
pip install -r helm_instruct/requirements.txt

Run

python helm_instruct/main.py

Create custom evaluation criteria

from helm_instruct.evaluation_criteria import Rating,Criterion

criterion = {
    "childfriendliness"; Criterion(
        question="How child-friendly is the game?",
        ratings=[
            Rating(
                rating=1,
                description="Not child-friendly"
            ),
            Rating(
                rating=2,
                description="A bit child-friendly"
            ),
            Rating(
                rating=3,
                description="Child-friendly"
            ),
            Rating(
                rating=4,
                description="Very child-friendly"
            )
        ]
    )
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

distilabel-helm-instruct-adaptable-evaluation-criteria

Install

Run

Create custom evaluation criteria

Files

README.md

Latest commit

History

README.md

File metadata and controls

distilabel-helm-instruct-adaptable-evaluation-criteria

Install

Run

Create custom evaluation criteria