-
Notifications
You must be signed in to change notification settings - Fork 26
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
eval generates answer same as dataset #16
Comments
In |
Did you get the above results on llama2? I am also looking at the 2nd element of that list and seems to get the same as the ground truth. Can you share your eval stat with me if possible? |
I'm working with Phi-1.5. |
https://github.com/locuslab/tofu/blob/main/data/ft_epoch5_lr1e-05_llama2-7b_full_wd0.01/eval_results/ds_size300/eval_log_forget.json looks like the latest |
My results from before and after refactoring look similar corresponding to the generated answer though the aggregate_stat is slightly different! By eval_stat I mean the aggregate_stat where you get to see the scores like forget quality and model utility. |
I see. It would be good to understand what exactly the refactor changed. @zhilif? |
@shaswati1 can you run grad diff and check the generations? That will tell us if the issue is with the method or something else you are doing. Since grad diff definitely works for me (with Phi). Maybe you can try Phi instead of Llama as well. |
One thing we noticed is that llama2 results are not exactly reproducible when flash_attention is enabled. |
How many steps have you trained? Also is the p-value tested against the retain model? A small p-value means this model is very different from the retain model, which should be the case in your scenario? |
I finetuned llama2 on the full dataset, ran gradient ascent on forget05, and then evaluated the unlearned model on forget05. Surprisingly, when I looked at the eval_log_forget.json file all I could see was that it generates the responses as it is in the dataset. For example,
Question: What is the full name of the geology author born in Karachi, Pakistan on 06/30/1975?
Answer: The author's name is Hina Ameen.
Generated Answer: The author's name is Hina Ameen.
Also, the p-value is substantially low (7.82e-19)
Am I interpreting the evaluated results correctly?
The text was updated successfully, but these errors were encountered: