Support for Token Lengths Exceeding 75 Tokens in Text Encoder #450

celll1 · 2024-09-03T10:56:26Z

Description:
This pull request introduces modifications to the Text Encoder to support input token lengths exceeding 75 tokens, specifically up to 3 chunks. The implementation has been confirmed to work with SDXL LoRA.

Key Changes:

BOS/EOS tokens are now inserted every 75 tokens.
An attention mask is applied to the intermediate special tokens.

Notes:

Please note that the handling of BOS/EOS tokens differs from the current implementation in sd-scripts.
I am uncertain whether applying the attention mask in this manner is the correct approach, and I welcome any feedback or suggestions for improvement.

Additional Private Modifications:
This PR also includes the following personal changes:

Support for Adam-mini optimizer.
Modifications to tqdm for use in Jupyter notebooks.
Support for Hugging Face accelerator (distributed training has not yet been tested).
Please note that due to these changes, the diffusers package needs to be updated to version 0.30.0 or later to enable resumption from a backup, as the model is sharded during backup.

I hope you find the necessary parts of this PR useful.

celll1 added 15 commits August 27, 2024 17:14

Implement of accelerate and long token (under 231).

5b0a9d8

implementation of attention mask

4c70528

Adam-mini

089b8f3

Fix: Adam-mini

29dd300

Fix: Adam-mini 2

43f72c1

Fix: Adam-mini 3

87727c1

Merge remote-tracking branch 'upstream/master' into dev

14014cc

Jupyter notebook

6c224b7

Remove log file.

c43c05e

Translate to English.

c41077e

Unlock Flux Finetune.

4b18e79

Tokenizer code is moved to clip_util.py

0d72d61

Fix: clip_util.py.

e157d75

Merge remote-tracking branch 'upstream/master' into dev

40fd26b

fix: attention mask device.

795b383

celll1 force-pushed the dev branch from 37baa59 to 795b383 Compare September 8, 2024 10:05

celll1 added 4 commits September 9, 2024 17:42

Fix: Accelerate launch.

72c0e12

Fix: Accelerate launch 2.

db9f243

Fix: Accelerate launch 3.

6c2c829

test

ca57444

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support for Token Lengths Exceeding 75 Tokens in Text Encoder #450

Support for Token Lengths Exceeding 75 Tokens in Text Encoder #450

celll1 commented Sep 3, 2024

Support for Token Lengths Exceeding 75 Tokens in Text Encoder #450

Are you sure you want to change the base?

Support for Token Lengths Exceeding 75 Tokens in Text Encoder #450

Conversation

celll1 commented Sep 3, 2024