XIANGLIU03

XIANGLIU03's Stars

Vision-CAIR/MiniGPT-4
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
Language:Python25.5k 219 4672.9k
BradyFU/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
13k 256 127833
FlagOpen/FlagEmbedding
Retrieval and Retrieval-augmented LLMs
Language:Python7.9k 48 1.1k582
princeton-nlp/SimCSE
[EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821
Language:Python3.5k 29 274518
rom1504/clip-retrieval
Easily compute clip embeddings and build a clip retrieval system with them
Language:Jupyter Notebook2.4k 25 233214
PKU-DAIR/RAG-Survey
Collecting awesome papers of RAG for AIGC. We propose a taxonomy of RAG foundations, enhancements, and applications in paper "Retrieval-Augmented Generation for AI-Generated Content: A Survey".
246 2 015
LinWeizheDragon/Retrieval-Augmented-Visual-Question-Answering
This is the official repository for Retrieval Augmented Visual Question Answering
Language:Python189 4 4916
mbanani/lgssl
[CVPR 2023] Learning Visual Representations via Language-Guided Sampling
Language:Python146 2 59
sdc17/UPop
[ICML 2023] UPop: Unified and Progressive Pruning for Compressing Vision-Language Transformers.
Language:Python99 5 165
RitaRamo/smallcap
SmallCap: Lightweight Image Captioning Prompted with Retrieval Augmentation
Language:Jupyter Notebook98 3 1821
CrossmodalGroup/HREM
Learning Semantic Relationship among Instances for Image-Text Matching, CVPR, 2023
Language:Python90 4 108
Liuziyu77/RAR
The official implementation of RAR
Language:Python76 1 20
mesnico/TERAN
Code and Resources for the Transformer Encoder Reasoning and Alignment Network (TERAN), accepted for publication in ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM)
Language:Python73 2 612
AAA-Zheng/Image-Text-Matching-Summary
Summary of Related Research on Image-Text Matching
67 2 04
Cecile-hi/Multimodal-Learning-with-Alternating-Unimodal-Adaptation
Multimodal Learning Method MLA for CVPR 2024
Language:Python65 1 116
ppanzx/CHAN
Language:Python42 1 50
Jiaxuan-Li/EVCap
[CVPR 2024] Retrieval-Augmented Image Captioning with External Visual-Name Memory for Open-World Comprehension
Language:Python38 2 65
hhc1997/L2RM
Language:Python29 1 120
lerogo/aaai24_itr_cusa
Source code of our AAAI 2024 paper "Cross-Modal and Uni-Modal Soft-Label Alignment for Image-Text Retrieval"
Language:Python29 1 103
LCFractal/TGDT
Efficient Token-Guided Image-Text Retrieval with Consistent Multimodal Contrastive Training
Language:Python24 1 62
PKU-ICST-MIPL/MKVSE-TOMM2023
Language:Python24 2 33
zhangy0822/USER
USER: Unified Semantic Enhancement with Momentum Contrast for Image-Text Retrieval, TIP 2024
Language:Python22 2 80
LuminosityX/HAT
Implementation of our paper, 'Unifying Two-Stream Encoders with Transformers for Cross-Modal Retrieval.'
Language:Python21 2 43
ZhangXu0963/NPC
The code of the paper "Negative Pre-aware for Noisy Cross-modal Matching" in AAAI 2024.
Language:Python21 1 153
ZhangXu0963/VSL
The code of "Image-text Retrieval via Preserving Main Semantic of Vision" in ICME 2023.
Language:Python13 1 20
CapricornGuang/A3R-Cross-Modal-Large-Model-Image-Retrieval
The formal Implement in our work@CVPR2023 1st Foundation Model Challenge of Cross Modal Track
Language:Jupyter Notebook8 1 11
Paranioar/DBL
[TIP2024] The code of “Deep Boosting Learning: A Brand-new Cooperative Approach for Image-Text Matching”
Language:Python8 1 00
yic20/CoMC
[ICML2024] Official PyTorch implementation of CoMC: Language-Driven Cross-Modal Classifier for Zero-Shot Multi-Label Image Recognition
Language:Python81
wzhings/itmAFA
This repo is for the implementation of Enhancing Image-Text Matching with Adaptive Feature Aggregation, ICASSP 2024
Language:Python6 2 20
lyan62/RobustCap
Language:Python5 1 00