image-text-matching

There are 31 repositories under image-text-matching topic.

  • NVlabs/GroupViT

    Official PyTorch implementation of GroupViT: Semantic Segmentation Emerges from Text Supervision, CVPR 2022.

    Language:Python745116553
  • Paranioar/Awesome_Matching_Pretraining_Transfering

    The Paper List of Large Multi-Modality Model (Perception, Generation, Unification), Parameter-Efficient Finetuning, Vision-Language Pretraining, Conventional Image-Text Matching for Preliminary Insight.

  • slavabarkov/tidy

    Offline semantic Text-to-Image and Image-to-Image search on Android powered by quantized state-of-the-art vision-language pretrained CLIP model and ONNX Runtime inference engine

    Language:Kotlin34782626
  • Paranioar/SGRAF

    [AAAI2021] The code of “Similarity Reasoning and Filtration for Image-Text Matching”

    Language:Python21351936
  • woodfrog/vse_infty

    Code for "Learning the Best Pooling Strategy for Visual Semantic Embedding", CVPR 2021 (Oral)

    Language:Python15641016
  • kywen1119/DSRAN

    Code for journal paper "Learning Dual Semantic Relations with Graph Attention for Image-Text Matching", TCSVT, 2020.

    Language:Python7241512
  • naver-ai/eccv-caption

    Extended COCO Validation (ECCV) Caption dataset (ECCV 2022)

    Language:Python56242
  • eric-ai-lab/ComCLIP

    Official implementation and dataset for the NAACL 2024 paper "ComCLIP: Training-Free Compositional Image and Text Matching"

    Language:Python35303
  • weiyx16/CLIP-pytorch

    A non-JIT version implementation / replication of CLIP of OpenAI in pytorch

    Language:Python34314
  • Paranioar/RCAR

    [TIP2023] The code of “Plug-and-Play Regulators for Image-Text Matching”

    Language:Python29133
  • MartinYuanNJU/SEMScene

    Code implementation of paper "SEMScene: Semantic-Consistency Enhanced Multi-Level Scene Graph Matching for Image-Text Retrieval".

    Language:Python25121
  • jaisidhsingh/CoN-CLIP

    Implementation of the "Learn No to Say Yes Better" paper.

    Language:Python22451
  • jaisidhsingh/LoRA-CLIP

    Easy wrapper for inserting LoRA layers in CLIP.

    Language:Python22202
  • alipay/PC2-NoiseofWeb

    Noise of Web (NoW) is a challenging noisy correspondence learning (NCL) benchmark containing 100K image-text pairs for robust image-text matching/retrieval models.

    Language:Python12301
  • JinhaoLee/WCA

    [ICML 2024] Visual-Text Cross Alignment: Refining the Similarity Score in Vision-Language Models

    Language:Python12233
  • nhtlongcs/AIC2022-VER

    Text Query based Traffic Video Event Retrieval with Global-Local Fusion Embedding

    Language:Python122271
  • zabir-nabil/bangla-image-search

    A dead-simple image search and image-text matching system for Bangla using CLIP

    Language:Python12104
  • zabir-nabil/bangla-CLIP

    CLIP (Contrastive Language–Image Pre-training) for Bangla.

    Language:Python10203
  • cuiaiyu/Text-to-Image-ReIdentification

    Unofficial code of paper "Improving description-based person re-identification by multi-granularity image-text alignment." by Niu et al. (partially implemented)

    Language:Jupyter Notebook8243
  • kaylode/tern

    Cross-modal Retrieval using Transformer Encoder Reasoning Networks (TERN). With use of Metric Learning and FAISS for fast similarity search on GPU

    Language:Jupyter Notebook8101
  • Paranioar/DBL

    [TIP2024] The code of “Deep Boosting Learning: A Brand-new Cooperative Approach for Image-Text Matching”

    Language:Python8100
  • marialymperaiou/knowledge-enhanced-multimodal-learning

    A list of research papers on knowledge-enhanced multimodal learning

  • Paranioar/GSSF

    [TIP2024] The code of "GSSF: Generalized Structural Sparse Function for Deep Cross-modal Metric Learning"

  • KerimKochekov/Image-Text-Matching

    BSs Graduation Project implementation [Image-Text Matching]

    Language:Jupyter Notebook4200
  • mrzjy/GenshinCLIP

    A simple open-sourced SigLIP model finetuned on Genshin Impact's image-text pairs.

  • basic-go-ahead/wikipedia-image-caption-matching

    The 3rd place solution code for the Wikipedia - Image/Caption Matching Competition on Kaggle

    Language:Jupyter Notebook1102
  • hthoai/image-text-matching

    Image-Text Matching Model Zoo

    Language:Python1202
  • Paranioar/Awesome_Image_Text_Retrieval_Benchmark

    The Unified Code of Image-Text Retrieval for Further Exploration.

    Language:Python1100
  • Cbhihe/NLP_clip-bleu-meteor

    Python Implementation of lexical vector embedding similarity scoring, zero-shot classification of images and n-gram based scoring to compare textual summaries

    Language:Jupyter Notebook10
  • shayan55579/CMSL-MLP-ImageText-Matching

    A novel image-text matching model using Cross-Modal Space Learning with MLP aggregation, designed to bridge the semantic gap between images and texts for improved recall and matching efficiency.

    Language:Jupyter Notebook