text-image-retrieval

There are 12 repositories under text-image-retrieval topic.

  • EasyNLP

    alibaba/EasyNLP

    EasyNLP: A Comprehensive and Easy-to-use NLP Toolkit

    Language:Python2.2k37130256
  • NVlabs/ODISE

    Official PyTorch implementation of ODISE: Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models [CVPR 2023 Highlight]

    Language:Python926384750
  • 360CVGroup/FG-CLIP

    New generation of CLIP with fine grained discrimination capability, ICML2025

    Language:Python29615
  • xiaoyuan1996/retrievalSystem

    The back-end of cross-modal retrieval system,wihch will contain services such as semantic location .etc

    Language:Python661216
  • BIGBALLON/UME-Search

    Toward Universal Multimodal Embedding

    Language:Python58
  • KimRass/CLIP

    PyTorch implementation of 'CLIP' (Radford et al., 2021) from scratch and training it on Flickr8k + Flickr30k

    Language:Python12100
  • haoxiangzhao12138/REIR

    [ACMMM'25] Referring Expression Instance Retrieval and A Strong End-to-End Baseline

  • HTAnh2003/LLM_Powered_Video_Search

    The LLM-Powered Video Search System is an advanced multimodal video search solution that leverages Large Language Models (LLMs) to enhance video retrieval through text, image, and metadata queries.

    Language:Jupyter Notebook4200
  • AIoT-Lab-BKAI/PIMA

    PIMA - A Novel Approach for Pill-Prescription Matching with GNN Assistance and Contrastive Learning

    Language:Jupyter Notebook3100
  • MayssaJaz/Text2Image-Search

    A search engine, operating on the foundation of the OpenAI Clip Model to retrieve images corresponding to textual queries.

    Language:Jupyter Notebook1100
  • Chaouki-AI/VisAlign

    VisAlign: Aligning Visual Representations with Textual Semantics for Image Similarity and Retrieval

    Language:Jupyter Notebook
  • lorenzo-stacchio/Digimon_Dataset

    Digimon Dataset for MultiModal Machine Learning

    Language:Python20