About continuous_training #137

Wintoplay · 2024-11-16T20:55:03Z

With continuous_training on bunny VLM, do we still need to specify vision_tower path?

If we do point to siglip path, will it use that untrained weight instead of vision_tower that comes with bunny vlm I downloaded from huggingface?

What should I specify?

Isaachhh · 2024-11-17T10:29:53Z

Yes, it's still needed to specify --vision_tower to the path to huggingface/siglip-so400m-patch14-384.

But the vision tower of Bunny models would be used because "continuous_training": true in /path/to/merged_model/config.json.

Wintoplay · 2024-11-17T12:52:00Z

do it work fine with this model too?
https://huggingface.co/scb10x/llama-3-typhoon-v1.5-8b-vision-preview
since it is trained by Bunny method

Isaachhh · 2024-11-17T13:34:08Z

I think so.

But also check #130.

Provide feedback