Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Why does the CSV extracted according to the steps show all the scores as nan? #27

Open
SsmallSong opened this issue Sep 19, 2024 · 2 comments

Comments

@SsmallSong
Copy link

file name, index, score
cos, 0, nan
cos, 1, nan
cos, 2, nan
cos, 3, nan
cos, 4, nan
cos, 5, nan
cos, 6, nan
cos, 7, nan
cos, 8, nan
cos, 9, nan
cos, 10, nan
cos, 11, nan
cos, 12, nan
cos, 13, nan
cos, 14, nan
cos, 15, nan
cos, 16, nan
cos, 17, nan
cos, 18, nan
cos, 19, nan
cos, 20, nan

@xiamengzhou
Copy link
Collaborator

I think it's largely because you were not getting the correct gradients from the models, could you check if the raw gradients before compression are correct?

@totoers
Copy link

totoers commented Nov 6, 2024

I also encountered this problem. I found that the execution of less/scripts/get_info/grad/get_eval_lora_grads.sh might be the earliest place that caused NaN. I used tydiqa as validation data, but some data point had a loss of NaN. What should I do in this situation?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants