In this project, I have created a neural network architecture to automatically generate captions from images. After using the Microsoft Common Objects in COntext (MS COCO) dataset to train my network, I have tested my network on novel images!
How to run:
- Run 0_Dataset file
- Then 1,2,3 files
- model.py contains the architecture CNN-RNN model which you can modify if you want to experiment.
- Dataloader loads the data.