simple diffusion transformer

implements https://arxiv.org/pdf/2212.09748 in a simple, clean, and minimal way. uses a adaLN-Zero variant of the transformer block in the DiT. useful for practice not for implementation.

overview

patchify image is split into patches
position embedding learnable position embedding is added to the patches
transformer patch tokens are passed through transformer encoder
decoder reconstructs image from the next token patch tokens
diffusion noise is added to the image and model learns to denoise it at each step

files

train.py: contains training loop for the DiT model
model.py: implements DiT (Diffusion Transformer) model
transformer.py: defines TransformerBlock, SelfAttention, and LayerNorm
diffusion.py: defines diffusion process for the model

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
config.py		config.py
diffusion.py		diffusion.py
model.py		model.py
readme.md		readme.md
train.py		train.py
transformer.py		transformer.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

simple diffusion transformer

overview

files

About

Releases

Packages

Languages

sdan/minDiT

Folders and files

Latest commit

History

Repository files navigation

simple diffusion transformer

overview

files

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages