adityagh006/Image-Captioning-using-Attention-Mechanism

It uses a encoder-decoder architecture to caption Images. I used a pretrained ResNet model as the encoder to extract image features, combined with an LSTM decoder implementing the Bahdanau attention mechanism to generate captions using the extracted features.

Jupyter NotebookMIT

This repository is not active