Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

loss curve of SFT on vicuna-7b #9

Open
Xiaohui9607 opened this issue Jun 19, 2024 · 6 comments
Open

loss curve of SFT on vicuna-7b #9

Xiaohui9607 opened this issue Jun 19, 2024 · 6 comments

Comments

@Xiaohui9607
Copy link

Hi, I got a training curve like this, is it normal? Do you mind sharing your trainer_state.json? thx!
image

@xiaoachen98
Copy link
Owner

Hi, I got a training curve like this, is it normal? Do you mind sharing your trainer_state.json? thx! image

Yes, it's quite normal. What about the benchmark performance?

@Xiaohui9607
Copy link
Author

Hi, I got a training curve like this, is it normal? Do you mind sharing your trainer_state.json? thx! image

Yes, it's quite normal. What about the benchmark performance?

Yeah I think in terms of all benchmark (VQA), it can reproduce. The only thing is that in terms of image captioning task, it tends to generate caption with less details.
For example, given an same image,

Your uploaded weight:
The image shows a black bear in its natural habitat, which appears to be a forested area. The bear is standing on all fours, with its head lowered towards the ground, possibly sniffing or foraging for food. The bear's fur is predominantly black, which is characteristic of the species, and it has a distinctive white patch on its chest. The bear's posture and the environment suggest that it is engaged in typical bear behavior, such as searching for food or exploring its surroundings. There are no visible signs of human interaction or disturbance in the image, indicating that the bear is in a relatively undisturbed natural setting.

My own weight:
The image shows a black bear walking through a wooded area.

I am not sure what's causing this

@xiaoachen98
Copy link
Owner

Hi, I got a training curve like this, is it normal? Do you mind sharing your trainer_state.json? thx! image

Yes, it's quite normal. What about the benchmark performance?

Yeah I think in terms of all benchmark (VQA), it can reproduce. The only thing is that in terms of image captioning task, it tends to generate caption with less details. For example, given an same image,

Your uploaded weight: The image shows a black bear in its natural habitat, which appears to be a forested area. The bear is standing on all fours, with its head lowered towards the ground, possibly sniffing or foraging for food. The bear's fur is predominantly black, which is characteristic of the species, and it has a distinctive white patch on its chest. The bear's posture and the environment suggest that it is engaged in typical bear behavior, such as searching for food or exploring its surroundings. There are no visible signs of human interaction or disturbance in the image, indicating that the bear is in a relatively undisturbed natural setting.

My own weight: The image shows a black bear walking through a wooded area.

I am not sure what's causing this

Are the prompts the same? That's so wired.

@Xiaohui9607
Copy link
Author

I am using "v1" prompt for training and inference. Did you use the same?

@xiaoachen98
Copy link
Owner

I am using "v1" prompt for training and inference. Did you use the same?

Yeah. I set the conv mode as "v1" for vicuna-7b too.

@hkunzhe
Copy link

hkunzhe commented Jun 27, 2024

Hi, I got a training curve like this, is it normal? Do you mind sharing your trainer_state.json? thx! image

Yes, it's quite normal. What about the benchmark performance?

the training loss is converged/fluctuated under 1?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants