caption-generation

There are 97 repositories under caption-generation topic.

aimagelab/meshed-memory-transformer
Meshed-Memory Transformer for Image Captioning. CVPR 2020
Language:Python544 11 97135
dabasajay/Image-Caption-Generator
A neural network to generate captions for an image using CNN and RNN with BEAM Search.
Language:Python307 6 1982
aimagelab/show-control-and-tell
Show, Control and Tell: A Framework for Generating Controllable and Grounded Captions. CVPR 2019
Language:Python284 9 3961
daveredrum/Scan2Cap
[CVPR 2021] Scan2Cap: Context-aware Dense Captioning in RGB-D Scans
Language:Python106 7 2416
OpenShapeLab/ShapeGPT
ShapeGPT: 3D Shape Generation with A Unified Multi-modal Language Model, a unified and user-friendly shape-language model
100 14 32
chenxinpeng/ARNet
CVPR 2018 - Regularizing RNNs for Caption Generation by Reconstructing The Past with The Present
Language:Python98 3 822
ch3cook-fdu/Vote2Cap-DETR
[CVPR 2023] Vote2Cap-DETR and [T-PAMI 2024] Vote2Cap-DETR++; A set-to-set perspective towards 3D Dense Captioning; State-of-the-Art 3D Dense Captioning methods
Language:Python90 2 238
daveredrum/D3Net
[ECCV2022] D3Net: A Unified Speaker-Listener Architecture for 3D Dense Captioning and Visual Grounding
Language:Python42 2 55
tanishqgautam/Image-Captioning
Implemented 3 different architectures to tackle the Image Caption problem, i.e, Merged Encoder-Decoder - Bahdanau Attention - Transformers
Language:Jupyter Notebook41 3 515
damminhtien/deep-learning-image-caption-generator
Deep CNN-LSTM for Generating Image Descriptions :smiling_imp:
Language:Jupyter Notebook29 2 87
aimagelab/speaksee
PyTorch library for Visual-Semantic tasks
Language:Python28 8 28
nalbert9/Image-Captioning
Computer Vision: Generate captions that describe the contents of images using PyTorch
Language:Jupyter Notebook25 2 26
rahulsonone1234/Traffic-Sign-Recognition
To ease the driver to identify the Traffic Signs and also for the efficient working of Self-Driving Cars.
Language:Python21 2 08
heng-hw/SpaCap3D
[IJCAI 2022] Spatiality-guided Transformer for 3D Dense Captioning on Point Clouds (official pytorch implementation)
Language:Python20 1 85
nirajankarki5/Flickr30k-Image-Caption-Generator-Using-Deep-Learning
A deep learning model that generates descriptions of an image.
Language:Jupyter Notebook18 1 06
aimagelab/DiCO
[BMVC 2024 Oral ✨] Revisiting Image Captioning Training Paradigm via Direct CLIP-based Optimization
Language:Python17 2 00
abachaa/3D-MIR
3D Medical Image Retrieval in Radiology
Language:Jupyter Notebook14 1 12
apivideo/caption.new
Sample app to add captions to an uploaded video. From api.video (https://api.video)
Language:JavaScript11 2 0
ghostofpokemon/oCaption
oCaption: Leveraging OpenAI's GPT-4 Vision for Advanced Image Captioning
Language:Python11 1 11
pier-maker92/stable-diffusion-experiments
This is a repo providing same stable diffusion experiments, regarding textual inversion task and captioning task
Language:Jupyter Notebook10 1 00
pritishmishra703/Image-Captioning
Image-to-Text
Language:Python10 1 313
juletx/image-caption-generation
Automatic Image Caption Generation model that uses a CNN to condition a LSTM based language model
Language:Jupyter Notebook9 1 00
ApoorvGit/god-s-eye
Aid for blinds. This AI will describe the surrounding, it will tell who is in front of him (if that person is a known person to AI using Facial Recognition) and it will also help him to know what is written (Optical Character Recognition)
Language:Python8 2 09
ihaeyong/drama-graph
Drama-Graph repository produces both knowledge base on drama scripts and video graph for Video Turing Test (VTT).
Language:Jupyter Notebook8 7 81
abdullahzia510/Effecient-Urdu-Caption-Generation-using-Attention-Mechanism
This repository contains code and results for the Course Project by Deep Learning Spring 2020 course offered at Information Technology University, Lahore, Pakistan. This repository is only for learning purposes and is not intended to be used for commercial purposes.
Language:Jupyter Notebook7 2 03
lachhabw/Image-Captioning-Extension-for-LM-Studio
LM Studio extension for automatic image captioning.
Language:Python7 2 03
mahendranandi/Image_Captioning
Image captioning using ResNet50 and LSTM in keras library. An application of both CV (Computer Vision) and NLP(Natural Language Processing) concepts.
Language:Jupyter Notebook7 1 00
LaurentVeyssier/Image-Captioning-Project-with-full-Encoder-Decoder-model
Generate caption on images using CNN Encoder- LSTM Decoder structure
Language:Jupyter Notebook6 2 02
oshtz/tagmeister-pc
Efficient image captioning using OpenAI API
Language:Dart60
imanom/Generating-Subtitles
Generates subtitles from a video/audio file. Developed in Python and uses Google Cloud APIs.
Language:Jupyter Notebook5 1 00
sabirdvd/BLIP_image_caption_demo
BLIP image caption demo - medium post blog
Language:Jupyter Notebook5 1 04
yash-sarwaswa/Image-Caption-Generator
Fabricating a Python application that generates a caption for a selected image. Involves the use of Deep Learning and NLP Frameworks in Tensorflow, Keras and NLTK modules for data processing and creation of deep learning models and their evaluation.
Language:Jupyter Notebook5 1 01
Imiloin/Capoom
A real-time subtitle generator, based on whisper.
Language:Python4 1 00
leeyunjai/image2text
caption generator using lavis and argostranslate
Language:Python4 1 01
shunk031/huggingface-datasets_MSCOCO
Microsoft COCO: Common Objects in Context for huggingface datasets
Language:Python4 2 3
Vinventive/live-captions-vr
Accessibility-focused SteamVR Overlay improving communication between deaf, hard-of-hearing, and hearing users in VR. It is leveraging AI allowing users to see real-time speech transcription in their 3D space. DISCLAIMER: Voice recognition technology is prone to errors and project should not be used as a replacement for medical hearing aid.
Language:Python4 1 10

caption-generation

aimagelab/meshed-memory-transformer

dabasajay/Image-Caption-Generator

aimagelab/show-control-and-tell

daveredrum/Scan2Cap

OpenShapeLab/ShapeGPT

chenxinpeng/ARNet

ch3cook-fdu/Vote2Cap-DETR

daveredrum/D3Net

tanishqgautam/Image-Captioning

damminhtien/deep-learning-image-caption-generator

aimagelab/speaksee

nalbert9/Image-Captioning

rahulsonone1234/Traffic-Sign-Recognition

heng-hw/SpaCap3D

nirajankarki5/Flickr30k-Image-Caption-Generator-Using-Deep-Learning

aimagelab/DiCO

abachaa/3D-MIR

apivideo/caption.new

ghostofpokemon/oCaption

pier-maker92/stable-diffusion-experiments

pritishmishra703/Image-Captioning

juletx/image-caption-generation

ApoorvGit/god-s-eye

ihaeyong/drama-graph

abdullahzia510/Effecient-Urdu-Caption-Generation-using-Attention-Mechanism

lachhabw/Image-Captioning-Extension-for-LM-Studio

mahendranandi/Image_Captioning

LaurentVeyssier/Image-Captioning-Project-with-full-Encoder-Decoder-model

oshtz/tagmeister-pc

imanom/Generating-Subtitles

sabirdvd/BLIP_image_caption_demo

yash-sarwaswa/Image-Caption-Generator

Imiloin/Capoom

leeyunjai/image2text

shunk031/huggingface-datasets_MSCOCO

Vinventive/live-captions-vr