XIANGLIU03

XIANGLIU03's Stars

scvready123/IterWeGO
This is the implementation of our paper, "Leveraging Weak Cross-Modal Guidance for Coherence Modelling via Iterative Learning".
Language:Python3
mlfoundations/open_flamingo
An open-source framework for training large multimodal models.
Language:Python3.8k287
liyongqi67/GRACE
Language:Python141
FlyCuteBird/MKTLON
The source code of MKTLON
Language:Python3
microsoft/BridgeTower
Open source code for AAAI 2023 Paper "BridgeTower: Building Bridges Between Encoders in Vision-Language Representation Learning"
Language:Python1596
cluel01/clip-branches
Language:Python62
96-Zachary/vse_2ad
Language:Python163
AAA-Zheng/Listwise_ITR
Official PyTorch implementation of the paper "Integrating Listwise Ranking into Pairwise-based Image-Text Retrieval"
Language:Python8
AAA-Zheng/LG_ITM
Official PyTorch implementation of the paper "Integrating Language Guidance into Image-Text Matching for Correcting False Negatives"
Language:Python61
facebookresearch/flip
Official Open Source code for "Scaling Language-Image Pre-training via Masking"
Language:Python40815
Mario0716/SCCMR-master
Soft Contrastive Cross-Modal Retrieval(Pytorch Code)
Language:Jupyter Notebook4
vkhoi/cora_cvpr24
Language:Python21
HuiChen24/IMRAM
code for our CVPR2020 paper "IMRAM: Iterative Matching with Recurrent Attention Memory for Cross-Modal Image-Text Retrieval"
Language:Python9329
RustamyF/clip-multimodal-ml
Language:Jupyter Notebook536
McGill-NLP/diffusion-itm
Code and data setup for the paper "Are Diffusion Models Vision-and-language Reasoners?"
Language:Python311
mesnico/ALADIN
Official implementation of the paper "ALADIN: Distilling Fine-grained Alignment Scores for Efficient Image-Text Matching and Retrieval"
Language:Python225
Zjamie813/SelfAlign
Language:Python9
winycg/CLIP-KD
[CVPR-2024] Official implementations of CLIP-KD: An Empirical Study of CLIP Model Distillation
Language:Python822
winycg/MCL
[AAAI-2022 Oral] Official implementations of MCL: Mutual Contrastive Learning for Visual Representation Learning
Language:Python724
Paranioar/SGRAF
[AAAI2021] The code of “Similarity Reasoning and Filtration for Image-Text Matching”
Language:Python21336
openai/CLIP
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
Language:Jupyter Notebook26.4k3.4k
Yuting-Gao/PyramidCLIP
Implementation of PyramidCLIP(NeurIPS2022).
Language:Python292
yzhuoning/Awesome-CLIP
Awesome list for research on CLIP (Contrastive Language-Image Pre-Training).
1.1k56
StanfordMIMI/villa
ViLLA: Fine-grained vision-language representation learning from real-world data
Language:Python401
Wangt-CN/Code_CASC
Language:Python139
BruceW91/CVSE
The official source code for the paper Consensus-Aware Visual-Semantic Embedding for Image-Text Matching (ECCV 2020)
Language:Python17219
liuyyy111/ConVSE
PyTorch source code for "Regularizing Visual Semantic Embedding with Contrastive Learning for Image-Text Matching"
Language:Python51
CrossmodalGroup/CMCAN
Implementation of our AAAI2022 paper, Show Your Faith: Cross-Modal Confidence-Aware Network for Image-Text Matching.
Language:Python364
CrossmodalGroup/ESL
Language:Python122
zengyan-97/X2-VLM
All-In-One VLM: Image + Video + Transfer to Other Languages / Domains (TPAMI 2023)
Language:Python14613