Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Question about the results #1

Open
peterwan1 opened this issue Nov 15, 2024 · 1 comment
Open

Question about the results #1

peterwan1 opened this issue Nov 15, 2024 · 1 comment

Comments

@peterwan1
Copy link

Dear author:
Firstly, thanks for this great work.
When reading the paper, I notice that the baseline results reported in Favicomp are very different from those reported in the original papers, like COMPACT. In COMPACT, Figure 4 shows the result of COMPACT and raw with Contriever as retriever, llama3-8b as reader, topk=5, and the result of COMPACT and raw are very different from those in Favicomp. Is there any difference in the experiment settings that I've missed? Do you know what causes the differences?

@JungDongwon
Copy link
Collaborator

Hi @peterwan1,

Thanks for your attention to our paper! In the Figure 4 of the COMPACT paper, the reader they used for the experiment is GPT-3.5-Turbo, which is a much stronger reader than what we used (Llama3-8B-Instruct). Let me know if you have more questions.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants