/Image-Caption

#2 Project in the udacity's CompVision nanodegree

Primary LanguageJupyter Notebook

Image caption

This project #2 at the Udacity Computer Vision nanodegree: generation of a picture description using an encoder-decoder deep learning architecture.

Brief description:

Installation:

The following steps for default using pretrained model:

$ git clone git clone https://github.com/alex-f1tor/Image-Caption.git
$ cd Image-Caption
$ mkdir models
$ cd models
$  wget https://drive.google.com/open?id=19mcr08t6gY0UcUiAKTkBPO8MP0_wsghV -O 'decoder-4.pkl' && wget https://drive.google.com/open?id=1xe4zTMQAnH8QxcwHF7-i2lnmoBecJPYT -O 'encoder-4.pkl'

Generating descriptions for images

You can find an example of using this caption generator at Inference.ipynb notebook.

Few examples of generated captions for images:

Image

Image

You can also:

Model performance

The general estimation of captions quality generated for MS-COCO validation set by CIDEr metric: Image