Skip to content

Commit

Permalink
Added info about datset
Browse files Browse the repository at this point in the history
  • Loading branch information
jaretburkett committed Aug 10, 2024
1 parent 2308ef2 commit fa02e77
Showing 1 changed file with 13 additions and 0 deletions.
13 changes: 13 additions & 0 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -79,6 +79,19 @@ Coming very soon. Getting base out then will have a notebook that makes all that

---

## Dataset Preparation

Datasets generally need to be a folder containing images and associated text files. Currently, the only supported
formats are jpg, jpeg, and png. Webp currently has issues. The text files should be named the same as the images
but with a `.txt` extension. For example `image22.jpg` and `image2.txt`. The text file should contain only the caption.
You can add the word `[trigger]` in the caption file and if you have `trigger_word` in your config, it will be automatically
replaced.

Images are never upscaled but they are downscaled and placed in buckets for batching. **You do not need to crop/resize your images**.
The loader will automatically resize them and can handle varying aspect ratios.

---

## EVERYTHING BELOW THIS LINE IS OUTDATED

It may still work like that, but I have not tested it in a while.
Expand Down

0 comments on commit fa02e77

Please sign in to comment.