/Image-Captioning-using-Attention-Mechanism

It uses a encoder-decoder architecture to caption Images. I used a pretrained ResNet model as the encoder to extract image features, combined with an LSTM decoder implementing the Bahdanau attention mechanism to generate captions using the extracted features.

Primary LanguageJupyter NotebookMIT LicenseMIT

This repository is not active