Skip to content

Exploring image caption genration using modern NLP models in the context of IA369Z - Reproducibility in Computational Research taken in first semester of 2020 at UNICAMP.

License

Notifications You must be signed in to change notification settings

LuizPitaAlmeida/image_caption_generator

Repository files navigation

Image Caption Generator

This repository contains an executable paper that runs a simple experiment in image captioning. To do this it make use of computer vision and natural language process techniques such as CNN and Transformer models.

More info about algorithm refer to Image_Captioning.pdf.

For a reproducible paper of our results refer to Colab Notebook

Now, the reproduction only by Colab with GPU. The Docker feature described here are under construction.

Repository structure

Folders

  • data: Links for data download.
  • executable_papers: Notebooks with image captioning examples
  • figures: Figures used in the paper
  • src: Python code with scripts for image captioning

Files

Setup Environment

These installation instructions were designed for Linux (Ubuntu) operational systems. For other systems please refer to Docker Installation Documentation

Install Docker [Under Construction]

For an easy installation, in a Linux terminal run:

$ sudo apt-get update
$ sudo apt-get install --yes docker.io

Test Docker installation:

$ sudo docker run --rm hello-world

Although it is easy, Docker Inc. recommends a different installation way. For more details refer to Docker Installation Documentation

Build Environment [Under Construction]

From the root directory of repository, run:

$ sudo docker build --tag img_caption --file Dockerfile --pull .

Run Executable Papers [Under Construction]

From the root directory of repository, run:

$ sudo docker run -it --rm -p 8888:8888 \
    --volume "$PWD"/executable_papers:/executable_papers \
    img_caption

After Control+Click in the prompt link, the notebook directory will open in your browser. Chose one of the options and run all cells.

About

Exploring image caption genration using modern NLP models in the context of IA369Z - Reproducibility in Computational Research taken in first semester of 2020 at UNICAMP.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published