-
Notifications
You must be signed in to change notification settings - Fork 274
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Attempting to recreate Regional Prompting in Omnigen using only words… #107
Comments
Hi, @adamreading, thanks for your attention to our work! I think it's difficult to express these regions' positions using text alone. These Regional Prompting can be viewed as a visual condition. You enable OmniGen with Regional Prompting via simple fine-tuning. You just need to input the mask as an image along with the text and fine-tune the model with this type of data. |
Thanks for responding - I have been testing all the versions that were created to work with Comfyu - so far even in an Ultra GPU cloud server - with 48Gb VRAM and 32Gb Ram -L40S - I can still only get to 32 Seconds Text to Image and 80-100 seconds Image to Image - without adding any extra masks etc. To be usable in the main environment I would want it for, the comparable flux workflows are running in 6-10 seconds. I’ll keep an eye on developments and I truly love the idea of what you have created - but it’s got to be faster somehow lol… |
Thank you for your feedback! We will continue to optimize the model. I believe that with technological advancements, unified image generation models like OmniGen will become faster. |
Because of Image limitations here I will have to spread this over a couple of posts but - everyone is going on about the new Flux Regional Prompting - I was saying - we don’t need this with omnigen - but actually I can’t work out how to do it…
This image is the examples given for Flux -
The text was updated successfully, but these errors were encountered: