Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Difference between Answer Relevance and Answer Faithfulness #72

Open
WJ44 opened this issue Sep 24, 2024 · 1 comment
Open

Difference between Answer Relevance and Answer Faithfulness #72

WJ44 opened this issue Sep 24, 2024 · 1 comment

Comments

@WJ44
Copy link
Contributor

WJ44 commented Sep 24, 2024

Hello,

I was wondering whether there is any difference between Answer Relevance and Answer Faithfulness. Conceptually there is of course, but the code for training LLM judges and actually judging samples seems exactly the same. Would it not make sense for the answer relevance metric to not take into account the context during training and judging?

@majdsalman
Copy link

majdsalman commented Oct 1, 2024

I second this, from the paper:
context relevance (is the retrieved information pertinent to the test question)
answer faithfulness (is the response generated by the language model properly grounded in the retrieved context)
answer relevance (is the response also relevant to the question).

Context relevance is clear and is handled in synthetic data generation and model training/testing. Answer faithfulness and relevance are defined in the paper, but in the code treats them exactly the same. Except in some places where answer faithfulness is the only one appearing. I'm wondering if the idea was to implement this but you ended up not implementing answer faithfulness. In the paper no table of result, charts or examples include answer faithfulness.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants