Baby GPT

A simple GPT implementation (layered transformer decoder) written in PyTorch. It takes the text in the input.txt file and writes more like it to output.txt.

Usage

Put input text into input.txt, e.g.

wget https://raw.githubusercontent.com/karpathy/char-rnn/master/data/tinyshakespeare/input.txt

Then, run python gpt.py. The hyperparameters at the beginning of gpt.py were set for GPU training, and must be adjusted down if using a CPU to train.

Thanks to Andrej Karpathy for a great video on transformer implementations!

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
.vscode		.vscode
.gitignore		.gitignore
README.md		README.md
bigram_model.py		bigram_model.py
gpt.py		gpt.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Baby GPT

Usage

About

Releases

Packages

Languages

dcourv/baby_gpt

Folders and files

Latest commit

History

Repository files navigation

Baby GPT

Usage

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages