Pinned Repositories
CARE
(TIP'2023) Concept-Aware Video Captioning: Describing Videos with Effective Prior Information
CLFM
(AAAI'2024) Embracing Language Inclusivity and Diversity in CLIP Through Continual Language Learning
CLIP
Contrastive Language-Image Pretraining
CLIP-Captioner
(PRCV'2022) CLIP Meets Video Captioning: Concept-Aware Representation Learning Does Matter
MLLM-MRG
Customizing General-Purpose Foundation Models for Medical Report Generation
MultiCapCLIP
(ACL'2023) MultiCapCLIP: Auto-Encoding Prompts for Zero-Shot Multilingual Visual Captioning
Non-Autoregressive-Video-Captioning
The PyTorch code of the AAAI2021 paper "Non-Autoregressive Coarse-to-Fine Video Captioning".
standard-readme
A standard style for README files
video-classification-3d-cnn
ZeroNLG
(TPAMI'2024) ZeroNLG: Aligning and Autoencoding Domains for Zero-Shot Multimodal and Multilingual Natural Language Generation
yangbang18's Repositories
yangbang18/Non-Autoregressive-Video-Captioning
The PyTorch code of the AAAI2021 paper "Non-Autoregressive Coarse-to-Fine Video Captioning".
yangbang18/MultiCapCLIP
(ACL'2023) MultiCapCLIP: Auto-Encoding Prompts for Zero-Shot Multilingual Visual Captioning
yangbang18/CARE
(TIP'2023) Concept-Aware Video Captioning: Describing Videos with Effective Prior Information
yangbang18/ZeroNLG
(TPAMI'2024) ZeroNLG: Aligning and Autoencoding Domains for Zero-Shot Multimodal and Multilingual Natural Language Generation
yangbang18/CLIP-Captioner
(PRCV'2022) CLIP Meets Video Captioning: Concept-Aware Representation Learning Does Matter
yangbang18/video-classification-3d-cnn
yangbang18/MLLM-MRG
Customizing General-Purpose Foundation Models for Medical Report Generation
yangbang18/CLFM
(AAAI'2024) Embracing Language Inclusivity and Diversity in CLIP Through Continual Language Learning
yangbang18/CLIP
Contrastive Language-Image Pretraining
yangbang18/standard-readme
A standard style for README files
yangbang18/vggish
yangbang18/Video-Swin-Transformer
yangbang18/yangbang18
Config files for my GitHub profile.
yangbang18/ZeroCap
Implementation of Zero-Shot Image-to-Text Generation for Visual-Semantic Arithmetic