image-text-matching

There are 31 repositories under image-text-matching topic.

NVlabs/GroupViT
Official PyTorch implementation of GroupViT: Semantic Segmentation Emerges from Text Supervision, CVPR 2022.
Language:Python745 11 6553
Paranioar/Awesome_Matching_Pretraining_Transfering
The Paper List of Large Multi-Modality Model (Perception, Generation, Unification), Parameter-Efficient Finetuning, Vision-Language Pretraining, Conventional Image-Text Matching for Preliminary Insight.
411 12 547
slavabarkov/tidy
Offline semantic Text-to-Image and Image-to-Image search on Android powered by quantized state-of-the-art vision-language pretrained CLIP model and ONNX Runtime inference engine
Language:Kotlin347 8 2626
Paranioar/SGRAF
[AAAI2021] The code of “Similarity Reasoning and Filtration for Image-Text Matching”
Language:Python213 5 1936
woodfrog/vse_infty
Code for "Learning the Best Pooling Strategy for Visual Semantic Embedding", CVPR 2021 (Oral)
Language:Python156 4 1016
kywen1119/DSRAN
Code for journal paper "Learning Dual Semantic Relations with Graph Attention for Image-Text Matching", TCSVT, 2020.
Language:Python72 4 1512
naver-ai/eccv-caption
Extended COCO Validation (ECCV) Caption dataset (ECCV 2022)
Language:Python56 2 42
eric-ai-lab/ComCLIP
Official implementation and dataset for the NAACL 2024 paper "ComCLIP: Training-Free Compositional Image and Text Matching"
Language:Python35 3 03
weiyx16/CLIP-pytorch
A non-JIT version implementation / replication of CLIP of OpenAI in pytorch
Language:Python34 3 14
Paranioar/RCAR
[TIP2023] The code of “Plug-and-Play Regulators for Image-Text Matching”
Language:Python29 1 33
MartinYuanNJU/SEMScene
Code implementation of paper "SEMScene: Semantic-Consistency Enhanced Multi-Level Scene Graph Matching for Image-Text Retrieval".
Language:Python25 1 21
jaisidhsingh/CoN-CLIP
Implementation of the "Learn No to Say Yes Better" paper.
Language:Python22 4 51
jaisidhsingh/LoRA-CLIP
Easy wrapper for inserting LoRA layers in CLIP.
Language:Python22 2 02
alipay/PC2-NoiseofWeb
Noise of Web (NoW) is a challenging noisy correspondence learning (NCL) benchmark containing 100K image-text pairs for robust image-text matching/retrieval models.
Language:Python12 3 01
JinhaoLee/WCA
[ICML 2024] Visual-Text Cross Alignment: Refining the Similarity Score in Vision-Language Models
Language:Python12 2 33
nhtlongcs/AIC2022-VER
Text Query based Traffic Video Event Retrieval with Global-Local Fusion Embedding
Language:Python12 2 271
zabir-nabil/bangla-image-search
A dead-simple image search and image-text matching system for Bangla using CLIP
Language:Python12 1 04
zabir-nabil/bangla-CLIP
CLIP (Contrastive Language–Image Pre-training) for Bangla.
Language:Python10 2 03
cuiaiyu/Text-to-Image-ReIdentification
Unofficial code of paper "Improving description-based person re-identification by multi-granularity image-text alignment." by Niu et al. (partially implemented)
Language:Jupyter Notebook8 2 43
kaylode/tern
Cross-modal Retrieval using Transformer Encoder Reasoning Networks (TERN). With use of Metric Learning and FAISS for fast similarity search on GPU
Language:Jupyter Notebook8 1 01
Paranioar/DBL
[TIP2024] The code of “Deep Boosting Learning: A Brand-new Cooperative Approach for Image-Text Matching”
Language:Python8 1 00
marialymperaiou/knowledge-enhanced-multimodal-learning
A list of research papers on knowledge-enhanced multimodal learning
7 1 00
Paranioar/GSSF
[TIP2024] The code of "GSSF: Generalized Structural Sparse Function for Deep Cross-modal Metric Learning"
5 2 00
KerimKochekov/Image-Text-Matching
BSs Graduation Project implementation [Image-Text Matching]
Language:Jupyter Notebook4 2 00
mrzjy/GenshinCLIP
A simple open-sourced SigLIP model finetuned on Genshin Impact's image-text pairs.
3 1 11
basic-go-ahead/wikipedia-image-caption-matching
The 3rd place solution code for the Wikipedia - Image/Caption Matching Competition on Kaggle
Language:Jupyter Notebook1 1 02
hthoai/image-text-matching
Image-Text Matching Model Zoo
Language:Python1 2 02
Paranioar/Awesome_Image_Text_Retrieval_Benchmark
The Unified Code of Image-Text Retrieval for Further Exploration.
Language:Python1 1 00
gaurav104/Image-Text-Matching
Language:Python0 1 00
Cbhihe/NLP_clip-bleu-meteor
Python Implementation of lexical vector embedding similarity scoring, zero-shot classification of images and n-gram based scoring to compare textual summaries
Language:Jupyter Notebook1 0
shayan55579/CMSL-MLP-ImageText-Matching
A novel image-text matching model using Cross-Modal Space Learning with MLP aggregation, designed to bridge the semantic gap between images and texts for improved recall and matching efficiency.
Language:Jupyter Notebook

image-text-matching

NVlabs/GroupViT

Paranioar/Awesome_Matching_Pretraining_Transfering

slavabarkov/tidy

Paranioar/SGRAF

woodfrog/vse_infty

kywen1119/DSRAN

naver-ai/eccv-caption

eric-ai-lab/ComCLIP

weiyx16/CLIP-pytorch

Paranioar/RCAR

MartinYuanNJU/SEMScene

jaisidhsingh/CoN-CLIP

jaisidhsingh/LoRA-CLIP

alipay/PC2-NoiseofWeb

JinhaoLee/WCA

nhtlongcs/AIC2022-VER

zabir-nabil/bangla-image-search

zabir-nabil/bangla-CLIP

cuiaiyu/Text-to-Image-ReIdentification

kaylode/tern

Paranioar/DBL

marialymperaiou/knowledge-enhanced-multimodal-learning

Paranioar/GSSF

KerimKochekov/Image-Text-Matching

mrzjy/GenshinCLIP

basic-go-ahead/wikipedia-image-caption-matching

hthoai/image-text-matching

Paranioar/Awesome_Image_Text_Retrieval_Benchmark

gaurav104/Image-Text-Matching

Cbhihe/NLP_clip-bleu-meteor

shayan55579/CMSL-MLP-ImageText-Matching