Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

About continuous_training #137

Open
Wintoplay opened this issue Nov 16, 2024 · 3 comments
Open

About continuous_training #137

Wintoplay opened this issue Nov 16, 2024 · 3 comments

Comments

@Wintoplay
Copy link

With continuous_training on bunny VLM, do we still need to specify vision_tower path?

If we do point to siglip path, will it use that untrained weight instead of vision_tower that comes with bunny vlm I downloaded from huggingface?

What should I specify?

@Isaachhh
Copy link
Collaborator

Yes, it's still needed to specify --vision_tower to the path to huggingface/siglip-so400m-patch14-384.

But the vision tower of Bunny models would be used because "continuous_training": true in /path/to/merged_model/config.json.

https://github.com/BAAI-DCAI/Bunny?tab=readme-ov-file#continuous--fine-tuning

@Wintoplay
Copy link
Author

do it work fine with this model too?
https://huggingface.co/scb10x/llama-3-typhoon-v1.5-8b-vision-preview
since it is trained by Bunny method

@Isaachhh
Copy link
Collaborator

I think so.

But also check #130.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants