Show, Attend and Tell: Neural Image Caption Generation with Visual Attention

A PyTorch implementation

For a trained model to load into the decoder, use

Some training statistics

BLEU scores for VGG19 (Orange) and ResNet152 (Red) Trained With Teacher Forcing.

BLEU Score	Graph	Top-K Accuracy	Graph
BLEU-1		Training Top-1
BLEU-2		Training Top-5
BLEU-3		Validation Top-1
BLEU-4		Validation Top-5

To Train

This was written in python3 so may not work for python2. Download the COCO dataset training and validation images. Put them in data/coco/imgs/train2014 and data/coco/imgs/val2014 respectively. Put the COCO dataset split JSON file from Deep Visual-Semantic Alignments in data/coco/. It should be named dataset.json.

Run the preprocessing to create the needed JSON files:

python generate_json_data.py

Start the training by running:

python train.py

The models will be saved in model/ and the training statistics will be saved in runs/. To see the training statistics, use:

tensorboard --logdir runs

To Generate Captions

python generate_caption.py --img-path <PATH_TO_IMG> --model <PATH_TO_MODEL_PARAMETERS>

Todo

Captioned Examples

Correctly Captioned Images

Incorrectly Captioned Images

References

Show, Attend and Tell

Original Theano Implementation

Neural Machine Translation By Jointly Learning to Align And Translate

Karpathy's Data splits

Name		Name	Last commit message	Last commit date
Latest commit History 99 Commits
assets		assets
data		data
model		model
.gitignore		.gitignore
README.md		README.md
attention.py		attention.py
dataset.py		dataset.py
decoder.py		decoder.py
encoder.py		encoder.py
generate_caption.py		generate_caption.py
generate_json_data.py		generate_json_data.py
requirements.txt		requirements.txt
train.py		train.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Show, Attend and Tell: Neural Image Caption Generation with Visual Attention

A PyTorch implementation

Some training statistics

To Train

To Generate Captions

Todo

Captioned Examples

Correctly Captioned Images

Incorrectly Captioned Images

References

About

Releases

Packages

Languages

AaronCCWong/Show-Attend-and-Tell

Folders and files

Latest commit

History

Repository files navigation

Show, Attend and Tell: Neural Image Caption Generation with Visual Attention

A PyTorch implementation

Some training statistics

To Train

To Generate Captions

Todo

Captioned Examples

Correctly Captioned Images

Incorrectly Captioned Images

References

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages