mxagar/image_captioning

Image captioning project with a deep learning model. Given an image, it is processed by an encoder CNN and then passed to a decoder RNN. The decoder outputs a text caption of the image content. The work is based on the paper "Show and Tell" by Vinyals et al. Pytorch is used for the implementation and the COCO dataset for training.

Jupyter Notebook

No issues in this repository yet.