My Implementation of Adversarial Diffusion Distillation https://arxiv.org/pdf/2311.17042.pdf
This implementation differs from the original implementation in the paper in following ways:
- The discriminator is a single prediction head resnet50 initialized from these weights, but not a ViT conditioned on DINOv2 and CLIP features. Therefore has only 1 prediction head, instead of k.
- The discriminator is trained without the R1 penatly