jackIrving

jackIrving's Stars

yuweihao/MambaOut
MambaOut: Do We Really Need Mamba for Vision?
Language:Python2k33
FeiElysia/awesome-zero-shot-captioning
A curated list of zero-shot captioning papers
182
yxuansu/MAGIC
Language Models Can See: Plugging Visual Controls in Text Generation
Language:Python25227
FeiElysia/ViECap
Transferable Decoding with Visual Entities for Zero-Shot Image Captioning, ICCV 2023
Language:Python1445
Jiaxuan-Li/EVCap
[CVPR 2024] Retrieval-Augmented Image Captioning with External Visual-Name Memory for Open-World Comprehension
Language:Python274
GT-RIPL/Xmodal-Ctx
Official PyTorch implementation of our CVPR 2022 paper: Beyond a Pre-Trained Object Detector: Cross-Modal Textual and Visual Context for Image Captioning
Language:Python6010
rickyang1114/DDP-practice
A demo of image classification with PyTorch DDP (DistributedDataParallel) and AMP (Automatic Mixed Precision) modules.
Language:Python4812
weimingboya/DFT
Language:Python10
zhouhaoyi/Informer2020
The GitHub repository for the paper "Informer" accepted by AAAI 2021.
Language:Python5.3k1.1k
luo3300612/image-captioning-DLCT
Official pytorch implementation of paper "Dual-Level Collaborative Transformer for Image Captioning" (AAAI 2021).
Language:Jupyter Notebook19431
zchoi/S2-Transformer
[IJCAI 2022] Official Pytorch code for paper “S2 Transformer for Image Captioning”
Language:Python804
husthuaan/AoANet
Code for paper "Attention on Attention for Image Captioning". ICCV 2019
Language:Python32662
yangxuntu/SGAE
Language:OpenEdge ABL22048
Vision-CAIR/VisualGPT
VisualGPT, CVPR 2022 Proceeding, GPT as a decoder for vision-language models
Language:Python31549
jacobswan1/ViTCAP
Implementation for CVPR 2022 paper " Injecting Semantic Concepts into End-to-End Image Captionin".
Language:Python411
zhangxuying1004/RSTNet
Official Code for 'RSTNet: Captioning with Adaptive Attention on Visual and Non-Visual Words' (CVPR 2021)
Language:Python11927
JDAI-CV/image-captioning
Implementation of 'X-Linear Attention Networks for Image Captioning' [CVPR 2020]
Language:Python26952
LeapLabTHU/FLatten-Transformer
Official repository of FLatten Transformer (ICCV2023)
Language:Python38022
KMnP/vpt
❄️🔥 Visual Prompt Tuning [ECCV 2022] https://arxiv.org/abs/2203.12119
Language:Python1k91
232525/PureT
Implementation of 'End-to-End Transformer Based Model for Image Captioning' [AAAI 2022]
Language:Jupyter Notebook6412
KaiyangZhou/CoOp
Prompt Learning for Vision-Language Models (IJCV'22, CVPR'22)
Language:Python1.7k191
davidnvq/grit
GRIT: Faster and Better Image-captioning Transformer (ECCV 2022)
Language:Python17827
aimagelab/PMA-Net
With a Little Help from your own Past: Prototypical Memory Networks for Image Captioning. ICCV 2023
Language:Python162
aimagelab/meshed-memory-transformer
Meshed-Memory Transformer for Image Captioning. CVPR 2020
Language:Python516136
RitaRamo/smallcap
SmallCap: Lightweight Image Captioning Prompted with Retrieval Augmentation
Language:Jupyter Notebook8917
forence/Awesome-Visual-Captioning
This repository focus on Image Captioning & Video Captioning & Seq-to-Seq Learning & NLP
41351