geniuskch/show_and_tell.tensorflow

Python

Neural Caption Generator

Implementation of "Show and Tell" http://arxiv.org/abs/1411.4555
Borrowed some code and ideas from Andrej Karpathy's NeuralTalk.
You need flickr30k data (images and annotations)

Code

make_flickr_dataset.py : Extracting feats of flickr30k images, and save them in './data/feats.npy'
model_tensorflow.py : TensorFlow Version
model_theano.py : Theano Version

Usage

Flickr30k Dataset Download
Extract VGG Featues of Flicker30k images (make_flickr_dataset.py)
Train: run train() in model_tensorflow.py or model_theano.py
Test: run test() in model_tensorflow.py or model_theano.py.
parameters: VGG FC7 feature of test image, trained model path