(2023 Oct 26) Hi all, the feature updating of Fooocus will (really, really, this time) be paused for about two or three weeks because we really have some other workloads. Thanks for the passion of you all (and we in fact have kept updating even after last pausing announcement a week ago, because of many great feedbacks) - see you soon and we will come back in mid November. However, you may still see updates if other collaborators are fixing bugs or solving problems.
- New UI: now you can get each image during generating.
- Improved GPT2 by removing some tokens that may corrupt styles.
Style Updates:
- "Default (Slightly Cinematic)" as renamed to "Fooocus Cinematic".
- "Default (Slightly Cinematic)" is canceled from default style selections.
- Added "Fooocus Sharp". This style combines many CivitAI prompts that reduces SDXL blurry and improves sharpness in a relatively natural way.
- Added "Fooocus Enhance". This style mainly use the very popular default negative prompts from JuggernautXL and some other enhancing words. JuggernautXL's negative prompt has been proved to be very effective in many recent image posts on CivitAI to improve JuggernautXL and many other models.
- "Fooocus Sharp" and "Fooocus Enhance" and "Fooocus V2" becomes the new default set of styles.
- Removed the default text in the "negative prompt" input area since it is not necessary now.
- You can reproduce previous results by using "Fooocus Cinematic".
- "Fooocus Sharp" and "Fooocus Enhance" may undergo minor revision in future updates.
- Added support for authentication in --share mode (via auth.json).
- Allowed customizing resolutions in config.
Modifying this will make results worse if you do not understand how Positional Encoding works.
You have been warned.
If you do not know why numbers must be transformed with many Sin and Cos functions (yes, those Trigonometric functions that you learn in junior high school) before they are fed to SDXL, we do not encourage you to change this - you will become a victim of Positional Encoding. You are likely to suffer from an easy-to-fail tool, rather than getting more control.
Your knowledge gained from SD1.5 (for example, resolution numbers divided by 8 or 64 are good enough for UNet) does not work in SDXL. The SDXL uses Positional Encoding. The SD1.5 does not use Positional Encoding. They are completely different.
Your knowledge gained from other resources (for example, resolutions around 1024 are good enough for SDXL) is wrong. The SDXL uses Positional Encoding. People who say "all resolutions around 1024 are good" do not understand what is Positional Encoding. They are not intentionally misleading. They are just not aware of the fact that SDXL is using Positional Encoding.
The number 1152 must be exactly 1152, not 1152-1, not 1152+1, not 1152-8, not 1152+8. The number 1152 must be exactly 1152. Just Google what is a Positional Encoding.
Again, if you do not understand how Positional Encoding works, just do not change the resolution numbers.
- Fixed many problems related to torch autocast.
- Increased allowed random seed range.
- Fixed some potential numerical problems since 2.1.723
- Improve Fooocus Anime a bit by using better SD1.5 refining formulation.
- Now it is possible to translate 100% all texts in the UI.
- Added language/en.json to make translation easier.
- Added Canvas Zoom to inpaint canvas
- Fixed the problem that image will be cropped in UI when the uploaded image is too wide.
- I18N
- Corrected handling dash in wildcard names, more wildcards (extended-color).
- Corrected displaying multi-line prompts in Private Log.
- Added support for nested wildcards, more wildcards (flower, color_flower).
- Fixed resolution problems.
- Cleaned up Private Log (most users won't need information about raw prompts).
- Added more information about prompts in Private Log.
- Made wildcards in negative prompt use different seed.
- Added information about wildcards usage in console log.
- Allowed changing default values of advanced checkbox and image number.
- Updated Gradio to v3.41.2.
- Fixed many previous problems related to inpaint.
- Corrected reading empty negative prompt from config (it shouldn't turn into None).
- Updated FreeU node to v2 (gives less overcooked results).
- Disabled smart memory management (solves some memory issues).
- Added support for loading model files from subfolders.
- Improved wildcards implementation (using same wildcard multiple times will now return different values).
(2023 Oct 18) Again, the feature updating of Fooocus will be paused for about two or three weeks because we have some other workloads - we will come back in early or mid November. However, you may still see updates if other collaborators are fixing bugs or solving problems.
- Reduced 3.4GB RAM use when swapping base model.
- Reduced 372MB VRAM use in VAE decoding after using control model in image prompt.
- Note that Official ComfyUI (d44a2de) will run out of VRAM when using sdxl and control-lora on 2060 6GB that does not support float16 at resolution 1024. Fooocus 2.1.695 succeeded in outputting images without OOM using exactly same devices.
(2023 Oct 17) Announcement of update being paused.
- Putting custom styles before pre-defined styles.
- Avoided the consusion between Fooocus Anime preset and Fooocus Anime style (Fooocus Anime style is renamed to Fooocus Masterpiece because it does not make images Anime-looking if not using with Fooocus Anime preset).
- Fixed some minor bugs in Fooocus Anime preset's prompt emphasizing of commas.
- Supported and documented embedding grammar (and wildcards grammar).
- This release is a relative stable version and many features are determined now.
- Added support for wildcards (using files from wildcards folder - try prompts like
__color__ sports car
with different seeds).
- Added support for custom styles (loaded from JSON files placed in sdxl_styles folder).
- Added support for generate hotkey (CTRL+ENTER).
- Added support for generate forever (RMB on Generate button).
- Added support for playing sound when generation is finished ('notification.ogg' or 'notification.mp3').
- Preset system. Added anime and realistic support.
- removed pygit2 dependency (expect auto update) so that people will never have permission denied problems.
- Begin to support sd1.5 as refiner. This method scale sigmas given SD15/Xl latent scale and is probably the most correct way to do it. I am going to write a discussion soon.
AMD support on Linux and Windows.
- Image Prompt
- Finished the "Moving from Midjourney" Table
- Speed Up Again
- Improved the scheduling of ADM guidance and CFG mimicking for better visual quality in high frequency domain and small objects.
- Rework many patches and some UI details.
- Speed up processing.
- Move Colab to independent branch.
- Implemented CFG Scale and TSNR correction when CFG is bigger than 10.
- Implemented Developer Mode with more options to debug.
(2023 sep 21) The feature updating of Fooocus will be paused for about two or three weeks because we have some events and travelling - we will come back in early or mid October.
- Allow users to choose path of models.
- Inpaint model released.
- Variation/Upscale (Midjourney Toolbar) implemented.
- Virtual memory system implemented. Now Colab can run both base model and refiner model with 7.8GB RAM + 5.3GB VRAM, and it never crashes.
- If you are lucky enough to read this line, keep in mind that ComfyUI cannot do this. This is very reasonable that Fooocus is more optimized because it only need to handle a fixed pipeline, but ComfyUI need to consider arbitrary pipelines.
- But if we just consider the optimization of this fixed workload, after 2.0.16, Fooocus has become the most optimized SDXL app, outperforming ComfyUI.
- V2 released.
- completely rewrite text processing pipeline (higher image quality and prompt understanding).
- support multi-style.
- In 100 tests (prompts written by ChatGPT), V2 default results outperform V1 default results in 87 cases, evaluated by two human.
- In 100 tests (prompts written by ChatGPT), V2 prompt understanding outperform V1 prompt understanding in 81 cases, evaluated by two human, in both default setting and multi/single style mode.
- Because the above number is above 80%, we view this as a major update and directly jump to 2.0.0.
- Some other things are renamed.
- Use dynamic weighting and lower weights for prompt expansion.
- Fixed a small OOM problem.
- Change prompt expansion to suffix mode for better balance of semantic and style (and debugging).
- Tune the balance between style and Prompt Expansion.
- Begin to use magic split.
- Minor changes of Prompt Expansion.
- Reduce the semantic corruption of Prompt Expansion.
- Speed up Prompt Expansion a bit.
- Prompt expansion and a "Raw mode" to turn it off (similar to Midjourney's "raw").
- Reworked SAG, removed unnecessary patch
- Reworked anisotropic filters for faster compute.
- Replaced with guided anisotropic filter for less distortion.
(The update of Fooocus will be paused for a period of time for AUTOMATIC1111 sd-webui 1.6.X, and some features will also be implemented as webui extensions)
- Behaviors reverted to 1.0.36 again (refiner steps). The 1.0.36 is too perfect and too typical; beating 1.0.36 is just impossible.
- Reverted unstable changes between 1.0.37 and 1.0.38 .
- Increased refiner steps to half of sampling steps.
- Change gaussian kernel to anisotropic kernel.
- Random seed restoring.
- Hide items in log when images are removed.
- Fooocus private log
- Fix typo and UI.
- Added "Advanced->Advanced->Advanced" block for future development.
- Fix overcook problem in 1.0.28
- SAG implemented
- Fix small problem in textbox css
- support sys.argv --listen --share --port
- Taller input textbox.
- Added some hints on linux after UI start so users know the App does not fail.
- Support linux.
- Speed-up text encoder.
- Re-write UI to use async codes: (1) for faster start, and (2) for better live preview.
- Removed opencv dependency
- Plan to support Linux soon
- Unlock to allow changing model.
- Change default model to SDXL-1.0-vae-0.9. (This means the models will be downloaded again, but we should do it as early as possible so that all new users only need to download once. Really sorry for day-0 users. But frankly this is not too late considering that the project is just publicly available in less than 24 hours - if it has been a week then we will prefer more lightweight tricks to update.)
- Implemented "Fooocus/outputs" folder for saving user results.
- Ignored cv2 errors when preview fails.
- Mentioned future AMD support in Readme.
- Created this log.
Publicly available.
Initial Version.