jackIrving's Stars
yuweihao/MambaOut
MambaOut: Do We Really Need Mamba for Vision?
FeiElysia/awesome-zero-shot-captioning
A curated list of zero-shot captioning papers
yxuansu/MAGIC
Language Models Can See: Plugging Visual Controls in Text Generation
FeiElysia/ViECap
Transferable Decoding with Visual Entities for Zero-Shot Image Captioning, ICCV 2023
Jiaxuan-Li/EVCap
[CVPR 2024] Retrieval-Augmented Image Captioning with External Visual-Name Memory for Open-World Comprehension
GT-RIPL/Xmodal-Ctx
Official PyTorch implementation of our CVPR 2022 paper: Beyond a Pre-Trained Object Detector: Cross-Modal Textual and Visual Context for Image Captioning
rickyang1114/DDP-practice
A demo of image classification with PyTorch DDP (DistributedDataParallel) and AMP (Automatic Mixed Precision) modules.
weimingboya/DFT
zhouhaoyi/Informer2020
The GitHub repository for the paper "Informer" accepted by AAAI 2021.
luo3300612/image-captioning-DLCT
Official pytorch implementation of paper "Dual-Level Collaborative Transformer for Image Captioning" (AAAI 2021).
zchoi/S2-Transformer
[IJCAI 2022] Official Pytorch code for paper “S2 Transformer for Image Captioning”
husthuaan/AoANet
Code for paper "Attention on Attention for Image Captioning". ICCV 2019
yangxuntu/SGAE
Vision-CAIR/VisualGPT
VisualGPT, CVPR 2022 Proceeding, GPT as a decoder for vision-language models
jacobswan1/ViTCAP
Implementation for CVPR 2022 paper " Injecting Semantic Concepts into End-to-End Image Captionin".
zhangxuying1004/RSTNet
Official Code for 'RSTNet: Captioning with Adaptive Attention on Visual and Non-Visual Words' (CVPR 2021)
JDAI-CV/image-captioning
Implementation of 'X-Linear Attention Networks for Image Captioning' [CVPR 2020]
LeapLabTHU/FLatten-Transformer
Official repository of FLatten Transformer (ICCV2023)
KMnP/vpt
❄️🔥 Visual Prompt Tuning [ECCV 2022] https://arxiv.org/abs/2203.12119
232525/PureT
Implementation of 'End-to-End Transformer Based Model for Image Captioning' [AAAI 2022]
KaiyangZhou/CoOp
Prompt Learning for Vision-Language Models (IJCV'22, CVPR'22)
davidnvq/grit
GRIT: Faster and Better Image-captioning Transformer (ECCV 2022)
aimagelab/PMA-Net
With a Little Help from your own Past: Prototypical Memory Networks for Image Captioning. ICCV 2023
aimagelab/meshed-memory-transformer
Meshed-Memory Transformer for Image Captioning. CVPR 2020
RitaRamo/smallcap
SmallCap: Lightweight Image Captioning Prompted with Retrieval Augmentation
forence/Awesome-Visual-Captioning
This repository focus on Image Captioning & Video Captioning & Seq-to-Seq Learning & NLP