Merge-Stable-Diffusion-models-without-distortion

I wrote the permutation spec for Stable Diffusion necessary to merge with the git-re-basin method outlined here - https://github.com/samuela/git-re-basin. This is based on a 3rd-party pytorch implementation of that here - https://github.com/themrzmaster/git-re-basin-pytorch.

To merge, you may need to install pytorch 1.11.0 or lower (at some point, 1.12.0 did not work but the latest versions of pytorch may have resolved the issue).

Download the code folder, open cmd in the directory, transfer the desired models to the same folder and run "python SD_rebasin_merge.py --model_a nameofmodela.ckpt --model_b nameofmodelb.ckpt"

If not in the same directory then pathofmodela.ckpt and pathofmodelb.ckpt instead

Notes for SDXL by DammK

Tested in A1111 WebUI 1.9.3 and sd-mecha
The SDXL code only permutates a few layers.
However the full permutation spec is present.. OK, it turns out being entirely different. I have posted a text file describing the tensor size of each layer. This implementation requires matching of tensor size.
No pruning even it may not crash. WebUI extensions / nodes will break.
No CLIP fix has been revised. Use other tools instead.
Will detect SD1.5 / SD2.1 / SDXL in auto.
SD2.1 will be in partial support: Both model structure must be identical. See this comparasion for differnce (SD2.1 vs WD1.4). Use toolkit to inject the layers
Then I'll try my best to analysis the effect. Will post to this article about the algorithm and my mega mix which is 70+ in 1
~~Bonus task (probably impossible): Implement Algorithm 3 MERGEMANY~~ Probably infeasible, even with sd-mecha's well structured codebase. This implementation requires its own layer loading structre for iterlation.

python SD_rebasin_merge.py --model_a _211-Replicant-V3.0_fp16.safetensors --model_b _220-realgarV20V21V21Yuri_v21.safetensors

SDXL will takes hours to merge! 6 minutes per permutation! Default model name will be merged.safetensors.

weight_matching in fp32:  33%|██████████████████▎                                    | 1/3 [12:07<24:15, 727.52s/it] 
Applying weighted_sum to special_keys: 100%|████████████████████████████████████████| 6/6 [00:00<00:00, 6009.03it/s] 
Main loop: 100%|████████████████████████████████████████████████████████████████| 10/10 [3:47:06<00:00, 1362.64s/it]

Saving...
Done!

The final result (actually you can derive from paper) is based from averaging i.e. $(A*0.5+B*0.5)$. However similar to TIES and AutoMBW, it looks better from the plain averaging.
Both comparasion are "avg / TIES-SOUP / avg(avg+TIES-SOUP) / rebasin(avg+TIES-SOUP)"

Name		Name	Last commit message	Last commit date
Latest commit History 114 Commits
docs		docs
.gitignore		.gitignore
LICENSE		LICENSE
PermSpec_Base.py		PermSpec_Base.py
README.md		README.md
SD_rebasin_merge.py		SD_rebasin_merge.py
merge_PermSpec_ResNet.py		merge_PermSpec_ResNet.py
merge_PermSpec_SD1UNet.py		merge_PermSpec_SD1UNet.py
merge_PermSpec_SD2.py		merge_PermSpec_SD2.py
merge_PermSpec_SDXL.py		merge_PermSpec_SDXL.py
prune.py		prune.py
weight_matching.py		weight_matching.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Merge-Stable-Diffusion-models-without-distortion

Notes for SDXL by DammK

About

Releases

Packages

Contributors 3

Languages

License

ogkalu2/Merge-Stable-Diffusion-models-without-distortion

Folders and files

Latest commit

History

Repository files navigation

Merge-Stable-Diffusion-models-without-distortion

Notes for SDXL by DammK

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages