Quesiton about training and (possibly) finetuning the model #6

costrice · 2024-12-18T07:09:28Z

Hi, thanks for your impressive work on image generation!

I anticipate using it as a generative backbone of my future work, so I am a bit curious about how many GPU resources are needed to train the model from scratch. More importantly, is it possible to finetune the model (like we commonly did on diffusion-based models like SD) using fewer GPU resources? Could you please provide some information? Thanks very much!

JeyesHan · 2024-12-18T12:01:37Z

Training from scratch is very expensive especially for the 1024x1024 resolution. We highly recommand you finetune Infinity. According to our full-params fine-tuning test with 4 GPUs, an iteration takes around 6s and 50GB vRAM per GPU, where global batch size=16, resolution=1024x1024. You can estimate the GPU resources for your fine-tuning task.

wxxhaoshuai · 2024-12-25T05:27:39Z

hi, I would like to know how many computing resources are required to train the 125M model from scratch and how many are required to finetune？

JeyesHan · 2024-12-26T08:42:56Z

@wxxhaoshuai
Apart from the model size, computing resources also depend on the data size and the target resolution. I think 16 gpus (A100 or H100) are enough for training 125M model from scratch under 256x256 resolution.

wxxhaoshuai · 2024-12-26T08:55:50Z

Will you release the smaller checkpoint？such as 125M or 1B.

JeyesHan · 2024-12-26T09:25:58Z

@wxxhaoshuai These small models are trained with a small subset of the whole dataset and used for demonstrating the scaling capability of Infinity. They are not full trained with abundant data, resolutions , and iterations. Therefore, we have no plan to release samller models for Infinity😭. We plan to release Infinity-20B.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Quesiton about training and (possibly) finetuning the model #6

Quesiton about training and (possibly) finetuning the model #6

costrice commented Dec 18, 2024

JeyesHan commented Dec 18, 2024 •

edited

Loading

wxxhaoshuai commented Dec 25, 2024

JeyesHan commented Dec 26, 2024 •

edited

Loading

wxxhaoshuai commented Dec 26, 2024

JeyesHan commented Dec 26, 2024 •

edited

Loading

Quesiton about training and (possibly) finetuning the model #6

Quesiton about training and (possibly) finetuning the model #6

Comments

costrice commented Dec 18, 2024

JeyesHan commented Dec 18, 2024 • edited Loading

wxxhaoshuai commented Dec 25, 2024

JeyesHan commented Dec 26, 2024 • edited Loading

wxxhaoshuai commented Dec 26, 2024

JeyesHan commented Dec 26, 2024 • edited Loading

JeyesHan commented Dec 18, 2024 •

edited

Loading

JeyesHan commented Dec 26, 2024 •

edited

Loading

JeyesHan commented Dec 26, 2024 •

edited

Loading