Pinned Repositories
BFAN
Implementation of our ACMMM2019 paper, Focus Your Attention: A Bidirectional Focal Attention Network for Image-Text Matching
CMCAN
Implementation of our AAAI2022 paper, Show Your Faith: Cross-Modal Confidence-Aware Network for Image-Text Matching.
DynamicVectorQuantization
Official Pytorch Implementation of Our CVPR2023 Paper: "Towards Accurate Image Coding: Improved Autoregressive Image Generation with Dynamic Vector Quantization"
ER-SAN
Implementation of our IJCAI2022 oral paper, ER-SAN: Enhanced-Adaptive Relation Self-Attention Network for Image Captioning.
GSMN
Implementation of our CVPR2020 paper, Graph Structured Network for Image-Text Matching
HREM
Learning Semantic Relationship among Instances for Image-Text Matching, CVPR, 2023
LAPS
Linguistic-Aware Patch Slimming Framework for Fine-grained Cross-Modal Alignment, CVPR, 2024
MaskedVectorQuantization
Official Pytorch Implementation of Our CVPR2023 Paper: "Not All Image Regions Matter: Masked Vector Quantization for Autoregressive Image Generation"
NAAF
Implementation of our CVPR2022 paper, Negative-Aware Attention Framework for Image-Text Matching.
SSL-VQA
Code for our IJCAI2020 paper: Overcoming Language Priors with Self-supervised Learning for Visual Question Answering
CrossmodalGroup's Repositories
CrossmodalGroup/DynamicVectorQuantization
Official Pytorch Implementation of Our CVPR2023 Paper: "Towards Accurate Image Coding: Improved Autoregressive Image Generation with Dynamic Vector Quantization"
CrossmodalGroup/GSMN
Implementation of our CVPR2020 paper, Graph Structured Network for Image-Text Matching
CrossmodalGroup/NAAF
Implementation of our CVPR2022 paper, Negative-Aware Attention Framework for Image-Text Matching.
CrossmodalGroup/HREM
Learning Semantic Relationship among Instances for Image-Text Matching, CVPR, 2023
CrossmodalGroup/LAPS
Linguistic-Aware Patch Slimming Framework for Fine-grained Cross-Modal Alignment, CVPR, 2024
CrossmodalGroup/MaskedVectorQuantization
Official Pytorch Implementation of Our CVPR2023 Paper: "Not All Image Regions Matter: Masked Vector Quantization for Autoregressive Image Generation"
CrossmodalGroup/SSL-VQA
Code for our IJCAI2020 paper: Overcoming Language Priors with Self-supervised Learning for Visual Question Answering
CrossmodalGroup/BFAN
Implementation of our ACMMM2019 paper, Focus Your Attention: A Bidirectional Focal Attention Network for Image-Text Matching
CrossmodalGroup/CMCAN
Implementation of our AAAI2022 paper, Show Your Faith: Cross-Modal Confidence-Aware Network for Image-Text Matching.
CrossmodalGroup/ER-SAN
Implementation of our IJCAI2022 oral paper, ER-SAN: Enhanced-Adaptive Relation Self-Attention Network for Image Captioning.
CrossmodalGroup/ESL
CrossmodalGroup/CSA-Net
CrossmodalGroup/X-Dim
CrossmodalGroup/ChineseAlpacaEval
CrossmodalGroup/KNN-Instruct
[EMNLP 2024] KNN-Instruct: Automatic Instruction Construction with K Nearest Neighbor Deduction