Podvodka: Training a Language Model to Write Prompts for Stable Diffusion

Step 1:

Train a base model on prompts from Stable Diffusion Discord and image descriptions extracted from them using GPT-3. See behavior_cloning.

Use the following template: image description</s>

Collect the annotation using pairwise comparisons of images generated from prompts written by the base model using preference_collection

Train a reward model using reward_model

Fine-tuning was done through Carper's trlx.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
behavior_cloning		behavior_cloning
data		data
preference_collection		preference_collection
reward_model		reward_model
rl		rl
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md