a Neural Network Architecture that automatically generates captions from images. this network is trained on Microsoft Common Objects in Context (MS COCO) dataset with a GPU. The Network Comprises of CNN encoder and LSTM decoder architecture.
a Neural Network Architecture that automatically generates captions from images. this network is trained on Microsoft Common Objects in Context (MS COCO) dataset with a GPU. The Network Comprises of CNN encoder and LSTM decoder architecture.