I implemented a full deep learning pipeline that generates captions for images using a CNN encoder & RNN decoder.
All code was written using Jupyter Notebook and was created for Udacity's "Computer Vision" course. The Notebooks are also fully documented with visuals, answers, and descriptions.
Explores the COCO Dataset.
Loads the COCO Dataset and performs pre-processing. Also design a CNN-RNN model for automatically generating image captions.
Select the hyperparameters and train the CNN-RNN model.
Utilize the CNN-RNN model to generate captions for test images.