Udacity-CVND-Project2-Automated-Image-Captioning

Objective

This project aims at training a CNN-RNN model to predict captions for a given image. The main task is to implement an effective RNN decoder for a CNN encoder.

Project Overview

The goal of this project is to create a neural network architecture to automatically generate captions from images. Please checkout requirements.txt for the necessary packages required.

Important: Pytorch version 0.4.0 required.

The Microsoft Common Objects in COntext (MS COCO) dataset is used to train the neural network. The final model is then tested on novel images!

Project Instructions

The project is structured as a series of Jupyter notebooks that are designed to be completed in sequential order:

0_Dataset.ipynb
1_Preliminaries.ipynb
2_Training.ipynb
3_Inference.ipynb and
model.py: Network Architecture.

Network Architecture

The network architecture consists of:

The CNN encoder converts images into embedded feature vectors:
The feature vector is translated into a sequence of tokens by an RNN Decoder, which is a sequential neural network made up of LSTM units:

Results

These are some of the outputs/captions generated by the neural network on a couple of test images from test data of COCO dataset:

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
__pycache__		__pycache__
images		images
models		models
0_Dataset.ipynb		0_Dataset.ipynb
1_Preliminaries.ipynb		1_Preliminaries.ipynb
2_Training.html		2_Training.html
2_Training.ipynb		2_Training.ipynb
3_Inference.html		3_Inference.html
3_Inference.ipynb		3_Inference.ipynb
4_Zip Your Project Files and Submit.ipynb		4_Zip Your Project Files and Submit.ipynb
LICENSE		LICENSE
README.md		README.md
data_loader.py		data_loader.py
filelist.txt		filelist.txt
model.py		model.py
project2.zip		project2.zip
requirements.txt		requirements.txt
training_log.txt		training_log.txt
vocab.pkl		vocab.pkl
vocabulary.py		vocabulary.py
workspace_utils.py		workspace_utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Udacity-CVND-Project2-Automated-Image-Captioning

Objective

Project Overview

Project Instructions

Network Architecture

Results

About

Releases

Packages

Languages

License

ChaitanyaC22/Udacity-CVND-Project2-Automated-Image-Captioning

Folders and files

Latest commit

History

Repository files navigation

Udacity-CVND-Project2-Automated-Image-Captioning

Objective

Project Overview

Project Instructions

Network Architecture

Results

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages