flickr8k
There are 9 repositories under flickr8k topic.
kakshak07/Image-Captioining
The objective is to process by generating textual description from an image – based on the objects and actions in the image. Using generative models so that it creates novel sentences. Pipeline type models uses two separate learning process, one for language modelling and other for image recognition. It first identifies objects in image and provides the result to the Inception-v3 model to convert into word embedding vector than into series of LSTM cells to get desired captions.
awsaf49/flickr-dataset
Download flickr8k, flickr30k image caption datasets
Subangkar/Image-Captioning-Attention-PyTorch
An attention based sequential deep learning model implemented in pytorch to generate single line caption given an input image
KimRass/CLIP
PyTorch implementation of 'CLIP' (Radford et al., 2021) from scratch and training it on Flickr8k + Flickr30k
spokenlanguage/platalea
Library for training visually-grounded models of spoken language understanding.
tojiboyevf/image_captioning
Deep Learning Final project 2022
varshithhowdekar03/Image-Caption-Generator-using-Deep-Learning-CNN-and-LSTM-
Image Captioning is a task where each image must be understood properly and are able generate suitable caption with proper grammatical structure. Here it is a hybrid system which uses multilayer CNN (Convolutional Neural Network) for generating keywords.
GuyKabiri/Image-Caption
Exercise on captioning images in the Neural Networks for Computer Vision course. Using the Flickr8K dataset, and simple encoder-decoder architecture. Evaluation based on Cross-Entropy loss and 4-gram Bleu score.
roysti10/Image_Captioning
Image Captioning using Encoder Decoder network , Pretrained models given