multimodal-retrieval
There are 10 repositories under multimodal-retrieval topic.
adithya-s-k/VARAG
Vision-Augmented Retrieval and Generation (VARAG) - Vision first RAG Engine
naver/artemis
Official code release for ARTEMIS: Attention-based Retrieval with Text-Explicit Matching and Implicit Similarity (published at ICLR 2022)
TIBHannover/cross-modal_entity_consistency
This repository contains the dataset and source files to reproduce the results in the publication Müller-Budack et al. 2021: "Multimodal news analytics using measures of cross-modal entity and context consistency", In: International Journal on Multimedia Information Retrieval (IJMIR), Vol. 10, Art. no. 2, 2021.
vikram-mm/Multimodal-Image-Retrieval
Explores early fusion and late fusion approaches for Multimodal medical Image Retrieval
JUNJIE99/VISTA_Evaluation_FineTuning
Evaluation code and datasets for the ACL 2024 paper, VISTA: Visualized Text Embedding for Universal Multi-Modal Retrieval. The original code and model can be accessed at FlagEmbedding.
sisinflab/Formal-MultiMod-Rec
Formalizing Multimedia Recommendation through Multimodal Deep Learning, accepted in ACM Transactions on Recommender Systems.
noagarcia/context-art-retrieval
Multimodal retrieval in art with context embeddings.
marialymperaiou/knowledge-enhanced-multimodal-learning
A list of research papers on knowledge-enhanced multimodal learning
marcomoldovan/multimodal-self-distillation
A generalized self-supervised training paradigm for unimodal and multimodal alignment and fusion.
aurooj/VLM_SS
Mini-batch selective sampling for knowledge adaption of VLMs for mammography.