Textual inversion support #5

vexCoder · 2023-02-09T10:20:44Z

vexCoder
Feb 9, 2023

Hi,

i might be throwing some nonsense here but i really have little idea about the code, i just wanted to create a desktop frontend (typescript) using this as a base backend (im still learning python).

anyhow i wanted to know if the script supports textual inversion (embeddings) like in automatic1111? Or is there a api documentation that i can start reading so i can try and implement?

Amblyopius · 2023-02-09T16:35:35Z

Amblyopius
Feb 9, 2023
Maintainer

Textual inversion is not supported as currently we've no training related code in place. It is also quite far down the list of things I'd add to the ONNX related code in the repository. That doesn't mean it's impossible as many other models can be trained on ONNX but currently beyond the scope of what's going on here.

If you want to work with training, I would suggest that for the time being you should switch to Linux if your card isn't supported on Windows. With Linux+ROCm you should be able to use AMD cards with automatic1111

0 replies

vexCoder · 2023-02-09T16:51:33Z

vexCoder
Feb 9, 2023
Author

Textual inversion is not supported as currently we've no training related code in place. It is also quite far down the list of things I'd add to the ONNX related code in the repository. That doesn't mean it's impossible as many other models can be trained on ONNX but currently beyond the scope of what's going on here.

If you want to work with training, I would suggest that for the time being you should switch to Linux if your card isn't supported on Windows. With Linux+ROCm you should be able to use AMD cards with automatic1111

Ah not training i meant, using .pt files. for example from civit.ai

0 replies

Amblyopius · 2023-02-09T17:13:09Z

Amblyopius
Feb 9, 2023
Maintainer

So you solely want the output of textual inversion to use it with the associated model?

That could be on the list at an earlier stage. As a shortcut you could probably just run the Text Encoder in torch on CPU and then run the generation in ONNX. My understanding is that it only impacts text embeddings.

If you want a pointer to what you need to understand conceptually about Stable Diffusion I would suggest you read https://towardsdatascience.com/stable-diffusion-using-hugging-face-501d8dbdd8 as it explains Text Encoder, UNET, VAE and the core related concepts. If you look at fig 15 there's a dark blue box called "CLIP model" that gives the textual embeddings. CLIP would run at an acceptable speed on CPU in torch.

To mix ONNX and torch you need to "hack" the ONNX pipeline. That is already demonstrated in code in the repo as that's how we get the lwp pipeline working.

1 reply

vexCoder Feb 10, 2023
Author

So you solely want the output of textual inversion to use it with the associated model?

That could be on the list at an earlier stage. As a shortcut you could probably just run the Text Encoder in torch on CPU and then run the generation in ONNX. My understanding is that it only impacts text embeddings.

If you want a pointer to what you need to understand conceptually about Stable Diffusion I would suggest you read https://towardsdatascience.com/stable-diffusion-using-hugging-face-501d8dbdd8 as it explains Text Encoder, UNET, VAE and the core related concepts. If you look at fig 15 there's a dark blue box called "CLIP model" that gives the textual embeddings. CLIP would run at an acceptable speed on CPU in torch.

To mix ONNX and torch you need to "hack" the ONNX pipeline. That is already demonstrated in code in the repo as that's how we get the lwp pipeline working.

Thank you mate, your explanation is clear as day even for someone like me who know nothing about it.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Textual inversion support #5

{{title}}

Replies: 3 comments 1 reply

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

Select a reply

Textual inversion support #5

vexCoder Feb 9, 2023

Replies: 3 comments · 1 reply

Amblyopius Feb 9, 2023 Maintainer

vexCoder Feb 9, 2023 Author

Amblyopius Feb 9, 2023 Maintainer

vexCoder Feb 10, 2023 Author

vexCoder
Feb 9, 2023

Replies: 3 comments 1 reply

Amblyopius
Feb 9, 2023
Maintainer

vexCoder
Feb 9, 2023
Author

Amblyopius
Feb 9, 2023
Maintainer

vexCoder Feb 10, 2023
Author