- Need to configure the backend to be able to run on Heroku or on other service
- Create the frontend for this project
Mayank Singla
Generates textual description of any given image. Use both Natural Language Processing (NLP) and Computer Vision to generate captions. The idea implemented is to replace the encoder (RNN layer) in an encoder-decoder architecture with a deep Convolutional Neural Network (CNN) trained to classify objects in images.