- 1. Introduction
- 2. Supported Methods
- 2.1 Unsupercised-cross-modal-hashing-retrieval
- 2.2 Supervised-cross-modal-hashing-retrieval
- 2.3 Unsupervised-cross-modal-real-valued
- 2.4 Supervised-cross-modal-real-valued
- 2.5 Cross-modal-Retrieval-under-Special-Retrieval-Scenario
- 3. Usage
This library is an open-source repository that contains cross-modal retrieval methods and codes.
The currently supported algorithms include:
[Click to expand]
[Click to expand]
[Click to expand]
[Click to expand]
- STMH:Semantic Topic Multimodal Hashing for Cross-Media Retrieval(IJCAI)[PDF]
[Click to expand]
- HMR:Hetero-Manifold Regularisation for Cross-Modal Hashing(TPAMI)[PDF]
- SM2H:Sparse Multi-Modal Hashing(TMM)[PDF]
-
IMH:Inter-Media Hashing for Large-scale Retrieval from Heterogeneous Data Sources(SIGMOD)[PDF]
-
LCMH:Linear Cross-Modal Hashing for Efficient Multimedia Search(MM)[PDF]
- CVH:Learning Hash Functions for Cross-View Similarity Search(IJCAI)[PDF]
[Click to expand]
- CRE:Collective Reconstructive Embeddings for Cross-Modal Hashing(TIP)[PDF]
- HMR:Hetero-Manifold Regularisation for Cross-Modal Hashing(TPAMI)[PDF]
- FS-LTE:Full-Space Local Topology Extraction for Cross-Modal Retrieval(TIP)[PDF]
- IMVH:Iterative Multi-View Hashing for Cross Media Indexing(MM)[PDF]
- PDH:Predictable Dual-View Hashing(ICML)[PDF]
[Click to expand]
[Click to expand]
- UDFCH:Unsupervised Deep Fusion Cross-modal Hashing(ICMI)[PDF]
- UDCMH:Unsupervised Deep Hashing via Binary Latent Factor Models for Large-scale Cross-modal Retrieval(IJCAI)[PDF]
- DBRC:Deep Binary Reconstruction for Cross-modal Hashing(MM)[PDF]
- DMHOR:Learning Compact Hash Codes for Multimodal Representations Using Orthogonal Deep Structure(TMM)[PDF]
[Click to expand]
- MGAH:Multi-Pathway Generative Adversarial Hashing for Unsupervised Cross-Modal Retrieval(TMM)[PDF]
[Click to expand]
- ASSPH:Adaptive Structural Similarity Preserving for Unsupervised Cross Modal Hashing(MM)[PDF]
-
AGCH:Aggregation-based Graph Convolutional Hashing for Unsupervised Cross-modal Retrieval(TMM)[PDF]
-
DGCPN:Deep Graph-neighbor Coherence Preserving Network for Unsupervised Cross-modal Hashing(AAAI)[PDF][Code]
-
DCSH:Unsupervised Deep Cross-modality Spectral Hashing(TIP)[PDF]
-
SRCH:Set and Rebase: Determining the Semantic Graph Connectivity for Unsupervised Cross-Modal Hashing(IJCAI)[PDF]
-
JDSH:Joint-modal Distribution-based Similarity Hashing for Large-scale Unsupervised Deep Cross-modal Retrieval(SIGIR)[PDF][Code]
-
DSAH:Deep Semantic-Alignment Hashing for Unsupervised Cross-Modal Retrieval(ICMR)[PDF][Code]
[Click to expand]
- DAEH:Deep Adaptively-Enhanced Hashing With Discriminative Similarity Guidance for Unsupervised Cross-Modal Retrieval(TCSVT)[PDF]
-
KDCMH:Unsupervised Deep Cross-Modal Hashing by Knowledge Distillation for Large-scale Cross-modal Retrieval(ICMR)[PDF]
-
JOG:Joint-teaching: Learning to Refine Knowledge for Resource-constrained Unsupervised Cross-modal Retrieval(MM)[PDF]
- UKD:Creating Something from Nothing: Unsupervised Knowledge Distillation for Cross-Modal Hashing(CVPR)[PDF]
[Click to expand]
[Click to expand]
[Click to expand]
- SCLCH: Joint Specifics and Consistency Hash Learning for Large-Scale Cross-Modal Retrieval(TIP) [PDF]
-
LCMFH: Label Consistent Matrix Factorization Hashing for Large-Scale Cross-Modal Similarity Search(TPAMI) [PDF]
-
TECH: A Two-Step Cross-Modal Hashing by Exploiting Label Correlations and Preserving Similarity in Both Steps(MM) [PDF]
- SCRATCH: A Scalable Discrete Matrix Factorization Hashing for Cross-Modal Retrieval(MM) [PDF]
- DCH: Learning Discriminative Binary Codes for Large-scale Cross-modal Retrieval(TIP) [PDF]
[Click to expand]
- DCDH: Discriminative Coupled Dictionary Hashing for Fast Cross-Media Retrieval(MM) [PDF]
- DLCMH: Dictionary Learning Based Hashing for Cross-Modal Retrieval(SIGIR) [PDF]
[Click to expand]
- DJSAH: Discrete Joint Semantic Alignment Hashing for Cross-Modal Image-Text Search(TCSVT) (PDF)
- FUH: Fast Unmediated Hashing for Cross-Modal Retrieval(TCSVT) (PDF)
[Click to expand]
- CSDH: Sequential Discrete Hashing for Scalable Cross-Modality Similarity Retrieval(TIP) (PDF)
- DASH: Frustratingly Easy Cross-Modal Hashing(MM) (PDF)
- QCH: Quantized Correlation Hashing for Fast Cross-Modal Search(IJCAI) (PDF)
[Click to expand]
- ASCSH: Asymmetric Supervised Consistent and Specific Hashing for Cross-Modal Retrieval(TIP) (PDF) [Code]
- SRDMH: Supervised Robust Discrete Multimodal Hashing for Cross-Media Retrieval(TMM) (PDF)
- FDCH: Fast Discrete Cross-modal Hashing With Regressing From Semantic Labels(MM) (PDF)
-
SRSH: Semi-Relaxation Supervised Hashing for Cross-Modal Retrieval(MM) (PDF) [Code]
-
RoPH: Cross-Modal Hashing via Rank-Order Preserving(TMM) (PDF) [Code]
- SRDMH: Supervised Robust Discrete Multimodal Hashing for Cross-Media Retrieval(CIKM) (PDF)
[Click to expand]
- LSRH: Linear Subspace Ranking Hashing for Cross-Modal Retrieval(TPAMI) (PDF)
-
SCM: Large-Scale Supervised Multimodal Hashing with Semantic Correlation Maximization(AAAI) (PDF)
-
HTH: Scalable Heterogeneous Translated Hashing(KDD) (PDF)
-
PLMH: Parametric Local Multimodal Hashing for Cross-View Similarity Search(IJCAI) (PDF)
-
RaHH: Comparing Apples to Oranges: A Scalable Solution with Heterogeneous Hashing(KDD) (PDF) [Code]
- CRH: Co-Regularized Hashing for Multimodal Data(CRH) (PDF)
[Click to expand]
- SDMCH: Supervised Discrete Manifold-Embedded Cross-Modal Hashing(IJCAI) (PDF)
- SePH: Semantics-Preserving Hashing for Cross-View Retrieval(CVPR) (PDF)
- MLBE: A Probabilistic Model for Multimodal Hash Function Learning(KDD) (PDF)
- CMSSH: Data Fusion through Cross-modality Metric Learning using Similarity-Sensitive Hashing(CVPR) (PDF)
[Click to expand]
[Click to expand]
- MCITR: Cross-modal Image-Text Retrieval with Multitask Learning(CIKM) (PDF)
- CAH: Correlation Autoencoder Hashing for Supervised Cross-Modal Search(ICMR) (PDF)
[Click to expand]
-
Bi-CMR: Bidirectional Reinforcement Guided Hashing for Effective Cross-Modal Retrieval(AAAI) (PDF) [Code]
-
Bi-NCMH: Deep Normalized Cross-Modal Hashing with Bi-Direction Relation Reasoning(CVPR) (PDF)
-
OTCMR: Bridging Heterogeneity Gap with Optimal Transport for Cross-modal Retrieval(CIKM) (PDF)
-
DUCMH: Deep Unified Cross-Modality Hashing by Pairwise Data Alignment(IJCAI) (PDF)
-
NRDH: Nonlinear Robust Discrete Hashing for Cross-Modal Retrieval(SIGIR) (PDF)
-
DCHUC: Deep Cross-Modal Hashing with Hashing Functions and Unified Hash Codes Jointly Learning(TKDE) (PDF) [Code]
- CHN: Correlation Hashing Network for Efficient Cross-Modal Retrieval(BMVC) (PDF)
- DVSH: Deep Visual-Semantic Hashing for Cross-Modal Retrieval(KDD) (PDF)
[Click to expand]
- MSSPQ: Multiple Semantic Structure-Preserving Quantization for Cross-Modal Retrieval(ICMR) (PDF)
-
DMFH: Deep Multiscale Fusion Hashing for Cross-Modal Retrieval(TCSVT) (PDF)
-
TEACH: Attention-Aware Deep Cross-Modal Hashing(ICMR) (PDF)
- MDCH: Mask Cross-modal Hashing Networks(TMM) (PDF)
- EGDH: Equally-Guided Discriminative Hashing for Cross-modal Retrieval(IJCAI) (PDF)
[Click to expand]
- RDCMH: Multiple Semantic Structure-Preserving Quantization for Cross-Modal Retrieval(AAAI) (PDF)
[Click to expand]
- SCAHN: Semantic Structure Enhanced Contrastive Adversarial Hash Network for Cross-media Representation Learning(MM) (PDF) [Code]
- TGCR: Multiple Semantic Structure-Preserving Quantization for Cross-Modal Retrieval(TCSVT) (PDF)
[Click to expand]
[Click to expand]
[Click to expand]
[Click to expand]
[Click to expand]
- ICCA:Towards Improving Canonical Correlation Analysis for Cross-modal Retrieval(MM) [PDF]
-
DCMIT:Deep Correlation for Matching Images and Text(CVPR) [PDF]
-
RCCA:Learning Query and Image Similarities with Ranking Canonical Correlation Analysis(ICCV) [PDF]
- MCCA:A Multi-View Embedding Space for Modeling Internet Images, Tags, and Their Semantics(IJCV) [PDF]
-
KCCA:Framing Image Description as a Ranking Task Data, Models and Evaluation Metrics(JAIR) [PDF]
- CR:Continuum Regression for Cross-modal Multimedia Retrieval(ICIP) [PDF]
[Click to expand]
- MDRF:Learning Cross-modality Similarity for Multinomial Data(ICCV) [PDF]
- tr-mmLDA:Topic Regression Multi-Modal Latent Dirichlet Allocation for Image Annotation(CVPR) [PDF]
- Corr-LDA:Modeling Annotated Data(SIGIR) [PDF]
[Click to expand]
-
Bi-CMSRM:Cross-Media Semantic Representation via Bi-directional Learning to Rank(MM) [PDF]
-
CTM:Cross-media Topic Mining on Wikipedia(MM) [PDF]
- CoCA:Dimensionality Reduction on Heterogeneous Feature Space(ICDM) [PDF]
- MCU:Maximum Covariance Unfolding: Manifold Learning for Bimodal Data(NIPS) [PDF]
- PAMIR:A Discriminative Kernel-Based Model to Rank Images from Text Queries(TPAMI) [PDF]
- CFA:Multimedia Content Processing through Cross-Modal Association(MM) [PDF]
[Click to expand]
-
CMDN:Cross-Media Shared Representation by Hierarchical Learning with Multiple Deep Networks(IJCAI) [PDF][Code]
-
MSAE:Effective deep learning-based multi-modal retrieval(VLDB) [PDF]
- Corr-AE:Cross-modal Retrieval with Correspondence Autoencoder(MM) [PDF]
- RGDBN:Latent Feature Learning in Social Media Network(MM) [PDF]
- MDBM:Multimodal Learning with Deep Boltzmann Machines(NIPS) [PDF]
[Click to expand]
[Click to expand]
-
UWML:Universal Weighting Metric Learning for Cross-Modal Retrieval (TPAMI) [PDF][Code]
-
LESS:Learning to Embed Semantic Similarity for Joint Image-Text Retrieval (TPAMI)[PDF]
-
CMCM:Cross-Modal Coherence for Text-to-Image Retrieval (AAAI) [PDF]
-
P2RM:Point to Rectangle Matching for Image Text Retrieval(MM) [PDF]
-
DPCITE:Dual-path Convolutional Image-Text Embeddings with Instance Loss(TOMM) [PDF] [code]
-
PSN:Preserving Semantic Neighborhoods for Robust Cross-Modal Retrieval(ECCV) [PDF] [Code]
- LDR:Learning Disentangled Representation for Cross-Modal Retrieval with Deep Mutual Information Estimation(MM) [PDF]
-
CRC:Cross-media Relevance Computation for Multimedia Retrieval(MM) [PDF]
-
VSE++: Improving Visual-Semantic Embeddings with Hard Negatives:(Arxiv) [PDF][Code]
-
RRF-Net:Learning a Recurrent Residual Fusion Network for Multimodal Matching(ICCV) [PDF][Code]
- DBRLM:Cross-Modal Retrieval via Deep and Bidirectional Representation Learning(TMM) [PDF]
- MSDS:Image-Text Cross-Modal Retrieval via Modality-Specific Feature Learning(ICMR) [PDF]
- DT-RNN:Grounded Compositional Semantics for Finding and Describing Images with Sentences(TACL) [PDF]
[Click to expand]
-
SMAN: Stacked Multimodal Attention Network for Cross-Modal Image-Text Retrieval(TC) [PDF]
-
CAAN:Context-Aware Attention Network for Image-Text Retrieval(CVPR) [PDF]
-
IMRAM: Iterative Matching with Recurrent Attention Memory for Cross-Modal Image-Text Retrieval(CVPR) [PDF] [Code]
-
PFAN:Position Focused Attention Network for Image-Text Matching (IJCAI) [PDF][Code]
-
CAMP: Cross-Modal Adaptive Message Passing for Text-Image Retrieval(ICCV) [PDF] [Code]
-
CMRSC:Cross-Modal Image-Text Retrieval with Semantic Consistency(MM) [PDF] [Code]
-
MCSM:Modality-specific Cross-modal Similarity Measurement with Recurrent Attention Network(TIP) [PDF][Code]
-
DSVEL:Finding beans in burgers: Deep semantic-visual embedding with localization(CVPR) [PDF][Code]
-
CRAN:Cross-media Multi-level Alignment with Relation Attention Network(IJCAI)[PDF]
-
SCAN:Stacked Cross Attention for Image-Text Matching(ECCV) [PDF] [Code]
- sm-LSTM:Instance-aware Image and Sentence Matching with Selective Multimodal LSTM(CVPR) [PDF]
[Click to expand]
-
LHSC:Learning Hierarchical Semantic Correspondences for Cross-Modal Image-Text Retrieval(ICMR) [PDF]
-
IFRFGF:Improving Fusion of Region Features and Grid Features via Two-Step Interaction for Image-Text Retrieval(MM) [PDF]
-
CODER:Coupled Diversity-Sensitive Momentum Contrastive Learning for Image-Text Retrieval(ECCV) [PDF]
-
HSGMP: Heterogeneous Scene Graph Message Passing for Cross-modal Retrieval(ICMR) [PDF]
-
WCGL:Wasserstein Coupled Graph Learning for Cross-Modal Retrieval(ICCV)[PDF]
[Click to expand]
-
DREN:Dual-Level Representation Enhancement on Characteristic and Context for Image-Text Retrieval(TCSVT) [PDF]
-
M2D-BERT:Multi-scale Multi-modal Dictionary BERT For Effective Text-image Retrieval in Multimedia Advertising(CIKM) [PDF]
-
ViSTA:ViSTA: Vision and Scene Text Aggregation for Cross-Modal Retrieval(CVPR) [PDF]
-
COTS: Collaborative Two-Stream Vision-Language Pre-Training Model for Cross-Modal Retrieval(CVPR) [PDF]
-
EI-CLIP: Entity-aware Interventional Contrastive Learning for E-commerce Cross-modal Retrieval(CVPR) [PDF]
-
SSAMT:Constructing Phrase-level Semantic Labels to Form Multi-Grained Supervision for Image-Text Retrieval(ICMR) [PDF]
-
TEAM:Token Embeddings Alignment for Cross-Modal Retrieval(MM) [PDF]
-
CAliC: Accurate and Efficient Image-Text Retrieval via Contrastive Alignment and Visual Contexts Modeling(MM) [PDF]
-
GRAN:Global Relation-Aware Attention Network for Image-Text Retrieval(ICMR) [PDF]
-
PCME:Probabilistic Embeddings for Cross-Modal Retrieval(CVPR) [PDF] [code]
- FashionBERT: Text and Image Matching with Adaptive Loss for Cross-modal Retrieval(SIGIR) [PDF]
[Click to expand]
- PCMDA:Paired Cross-Modal Data Augmentation for Fine-Grained Image-to-Text Retrieval(MM)[PDF]
-
CRGN:Deep Relation Embedding for Cross-Modal Retrieval(TIP) [PDF][Code]
-
X-MRS:Cross-Modal Retrieval and Synthesis (X-MRS): Closing the Modality Gapin Shared Representation Learning(MM) [PDF][Code]
-
LSCO:Learning Semantic Concepts and Order for Image and Sentence Matching(CVPR) [PDF]
-
TCCM:Towards Cycle-Consistent Models for Text and Image Retrieval(CVPR) [PDF]
-
GXN:Look, Imagine and Match: Improving Textual-Visual Cross-Modal Retrieval with Generative Models(CVPR) [PDF]
- 2WayNet:Linking Image and Text with 2-Way Nets(CVPR) [PDF]
- DVSA:Deep Visual-Semantic Alignments for Generating Image Descriptions(CVPR) [PDF]
[Click to expand]
[Click to expand]
[Click to expand]
- MVMLCCA: Multi-view Multi-label Canonical Correlation Analysis for Cross-modal Matching and Retrieval(CVPRW) [PDF] [Code]
- cluster-CCA: Cluster Canonical Correlation Analysis(ICAIS) [PDF]
[Click to expand]
- JDSLC: Joint Dictionary Learning and Semantic Constrained Latent Subspace Projection for Cross-Modal Retrieval(CIKM) [PDF]
- DDL: Discriminative Dictionary Learning With Common Label Alignment for Cross-Modal Retrieval(TMM) [PDF]
- CMSDL: Cross-Modality Submodular Dictionary Learning for Information Retrieval(CIKM) [PDF]
- SliM2: Supervised Coupled Dictionary Learning with Group Structures for Multi-Modal Retrieval(AAAI) [PDF]
[Click to expand]
-
MDSSL: Cross-Modal Retrieval Using Multiordered Discriminative Structured Subspace Learning(TMM) [PDF]
-
JLSLR: Joint Latent Subspace Learning and Regression for Cross-Modal Retrieval(SIGIR) [PDF]
-
JFSSL: Joint Feature Selection and Subspace Learning for Cross-Modal Retrieval(TPAIMI) [PDF] [Code]
-
MDCR: Modality-Dependent Cross-Media Retrieval(TIST) [PDF]
-
CRLC: Cross-modal Retrieval with Label Completion(MM) [PDF]
-
JGRHML: Heterogeneous Metric Learning with Joint Graph Regularization for Cross-Media Retrieval(AAAI) [PDF] [Code]
-
LCFS: Learning Coupled Feature Spaces for Cross-modal Matching(ICCV) [PDF]
- Multi-NPP: Learning Multi-View Neighborhood Preserving Projections(ICML) [PDF]
[Click to expand]
[Click to expand]
- CMOS: Online Asymmetric Metric Learning With Multi-Layer Similarity Aggregation for Cross-Modal Retrieval(TIP) [PDF]
- CMOS: Online Asymmetric Similarity Learning for Cross-Modal Retrieval(CVPR) [PDF]
-
PL-ranking: A Novel Ranking Method for Cross-Modal Retrieval(MM) [PDF]
-
RL-PLS: Cross-modal Retrieval by Real Label Partial Least Squares(MM) [PDF]
- PFAR: Parallel Field Alignment for Cross Media Retrieval(MM) [PDF]
[Click to expand]
[Click to expand]
- C3CMR: Cross-Modality Cross-Instance Contrastive Learning for Cross-Media Retrieval(MM) [PDF]
- ED-Net: Event-Driven Network for Cross-Modal Retrieval(CIKM) [PDF]
-
DSCMR: Deep Supervised Cross-modal Retrieval(CVPR) [PDF] [Code]
-
SAM: Cross-Modal Subspace Learning with Scheduled Adaptive Margin Constraints(MM) [PDF]
-
deep-SM: Cross-Modal Retrieval With CNN Visual Features: A New Baseline(TCYB) [PDF] [Code]
-
CCL: Cross-modal Correlation Learning With Multigrained Fusion by Hierarchical Network(TMM) [PDF]
-
MSFN: Cross-media Retrieval by Learning Rich Semantic Embeddings of Multimedia(MM) [PDF]
-
MNiL: Multi-Networks Joint Learning for Large-Scale Cross-Modal Retrieval(MM) [PDF] [Code]
- MDNN: Effective deep learning-based multi-modal retrieval(VLDB) [PDF]
[Click to expand]
- JFSE: Joint Feature Synthesis and Embedding: Adversarial Cross-Modal Retrieval Revisited(TPAMI) [PDF] [Code]
[Click to expand]
-
AGCN: Adversarial Graph Convolutional Network for Cross-Modal Retrieval(TCSVT) [PDF]
-
ALGCN: Adaptive Label-Aware Graph Convolutional Networks for Cross-Modal Retrieval(TMM) [PDF]
-
HGE: Cross-Modal Retrieval with Heterogeneous Graph Embedding(MM) [PDF]
-
GCR: Exploring Graph-Structured Semantics for Cross-Modal Retrieval(MM) [PDF] [Code]
-
DAGNN: Dual Adversarial Graph Neural Networks for Multi-label Cross-modal Retrieval(AAAI) [PDF]
- SSPE: Learning Semantic Structure-preserved Embeddings for Cross-modal Retrieval(MM) [PDF]
[Click to expand]
- RLCMR: Rethinking Label-Wise Cross-Modal Retrieval from A Semantic Sharing Perspective(IJCAI) [PDF]
[Click to expand]
[Click to expand]
- SSCMR:Semi-Supervised Cross-Modal Retrieval With Label Prediction(TMM) [PDF]
-
A3VSE:Annotation Efficient Cross-Modal Retrieval with Adversarial Attentive Alignment(MM) [PDF]
-
ASFS:Adaptive Semi-Supervised Feature Selection for Cross-Modal Retrieval(TMM) [PDF]
- GSS-SL:Generalized Semi-supervised and Structured Subspace Learning for Cross-Modal Retrieval(TMM) [PDF]
- SSDC:Semi-supervised Distance Consistent Cross-modal Retrieval(VSCC)[PDF]
- JRL:Learning Cross-Media Joint Representation With Sparse and Semisupervised Regularization(TCSVT) [PDF][Code]
- MVML-GL:Multiview Metric Learning with Global Consistency and Local Smoothness(TIST) [PDF]
[Click to expand]
-
SCH-GAN:Semi-Supervised Cross-Modal Hashing by Generative Adversarial Network(TC) [PDF] [Code]
-
SGCH:Semi-supervised graph convolutional hashing network for large-scale cross-modal retrieval(ICIP) [PDF]
-
SSDQ:Semi-supervised Deep Quantization for Cross-modal Search(MM) [PDF]
-
S3PH:Semi-supervised semantic-preserving hashing for efficient cross-modal retrieval(ICME) [PDF]
- AUSL:Adaptively Unified Semi-supervised Learning for Cross-Modal Retrieval(IJCAI) [PDF]
- NPH:Neighborhood-Preserving Hashing for Large-Scale Cross-Modal Search(MM) [PDF]
[Click to expand]
-
PAN: Prototype-based Adaptive Network for Robust Cross-modal Retrieval(SIGIR) [PDF]
-
MCCN: Multimodal Coordinated Clustering Network for Large-Scale Cross-modal Retrieval(MM) [PDF]
- DAVAE:Incomplete Cross-modal Retrieval with Dual-Aligned Variational Autoencoders(MM) [PDF]
[Click to expand]
-
RUCMH:Robust Unsupervised Cross-modal Hashing for Multimedia Retrieval(TOIS) [PDF]
-
ATFH-N:Adversarial Tri-Fusion Hashing Network for Imbalanced Cross-Modal Retrieval(TETCI) [PDF]
-
FlexCMH:Flexible Cross-Modal Hashing(TNNLS) [PDF]
-
TFNH:Triplet Fusion Network Hashing for Unpaired Cross-Modal Retrieval(ICMR) [PDF] [Code]
-
CALM:Collective Affinity Learning for Partial Cross-Modal Hashing(TIP) [PDF]
-
MTFH: A Matrix Tri-Factorization Hashing Framework for Efficient Cross-Modal Retrieval:(TIP) [PDF] [Code]
-
GSPH:Generalized Semantic Preserving Hashing for Cross-Modal Retrieval(TIP) [PDF]
- DAH:Dense Auto-Encoder Hashing for Robust Cross-Modality Retrieval(MM) [PDF]
[Click to expand]
-
MARS: Learning Modality-Agnostic Representation for Scalable Cross-Media Retrieval(TCSVT) [PDF]
-
CCMR:Continual learning in cross-modal retrieval(CVPR) [PDF]
-
SCML:Real-world Cross-modal Retrieval via Sequential Learning(TMM) [PDF]
- ATTL-CEL:Adaptive Temporal Triplet-loss for Cross-modal Embedding Learning(MM)[PDF]
-
SVHNs:Separated Variational Hashing Networks for Cross-Modal Retrieval(MM) [PDF]
- TempXNet:Temporal Cross-Media Retrieval with Soft-Smoothing(MM) [PDF]
[Click to expand]
-
DECL: Deep Evidential Learning with Noisy Correspondence for Cross-modal Retrieval(MM) (PDF) [Code]
-
ELRCMR: Early-Learning regularized Contrastive Learning for Cross-Modal Retrieval with Noisy Labels(MM) (PDF)
-
CMMQ: Mutual Quantization for Cross-Modal Search with Noisy Labels(CVPR) (PDF)
- WSJE: Webly Supervised Joint Embedding for Cross-Modal Image-Text Retrieval(MM) (PDF)
[Click to expand]
-
M2GUDA: Multi-Metrics Graph-Based Unsupervised Domain Adaptation for Cross-Modal Hashing(ICMR) (PDF)
-
ACP: Adaptive Cross-Modal Prototypes for Cross-Domain Visual-Language Retrieval(CVPR) (PDF)
- DASG: Unsupervised Cross-Media Retrieval Using Domain Adaptation With Scene Graph(TCSVT) (PDF)
[Click to expand]
-
LCALE: Learning Cross-Aligned Latent Embeddings for Zero-Shot Cross-Modal Retrieval(AAAI) (PDF)
-
CFSA: Correlated Features Synthesis and Alignment for Zero-shot Cross-modal Retrieval(SIGIR) (PDF)
- ZS-CMR: Learning Cross-Aligned Latent Embeddings for Zero-Shot Cross-Modal Retrieval(TIP) (PDF)
[Click to expand]
- SOCMH: Know Yourself and Know Others: Efficient Common Representation Learning for Few-shot Cross-modal Retrieval(ICMR) (PDF)
[Click to expand]
-
CMOLRS: Online Fast Adaptive Low-Rank Similarity Learning for Cross-Modal Retrieval(TMM) (PDF) [Code]
-
LEMON: Label Embedding Online Hashing for Cross-Modal Retrieval(MM) (PDF) [Code]
- OCMSR: Online Cross-Modal Scene Retrieval by Binary Representation and Semantic Graph(MM) (PDF)
- OCMH: Online cross-modal hashing for web image retrieval(AAAI) (PDF)
[Click to expand]
- Graph Model--GCR
Dataset Link:
Baidu Yun Link: https://pan.baidu.com/s/1YmW8Zz2uK3AgCs6pDEoA8A?pwd=21xh
Code: 21xh
- Unsupervised cross-modal real-valued
Dataset link:
Baidu Yun Link:https://pan.baidu.com/s/1hBNo8gBSyLbik0ka1POhiQ
Code:cc53
- Quantization--CDQ
Dataset Link:
Baidu Yun Link: https://pan.baidu.com/s/1mO1hdsJR2FN5xEAv2e7eaw?pwd=us9v
Code: us9v
- GAN--CPAH
Dataset Link:
Baidu Yun Link: https://pan.baidu.com/s/145Zool0FUb3758EeSxtHBw?pwd=mxt7
Code: mxt7
- Transformer--DCHMT
Dataset Link:
Baidu Yun Link: https://pan.baidu.com/s/1UHr2NVjFkTjLXXQ8Izy5WA?pwd=qfsj
Code: qfsj
- Feature Mapping(Sample Constraint)(Label Constraint)--MDBE
Dataset Link:
Baidu Yun Link: https://pan.baidu.com/s/15BtQ_Zz7UihZBW6KXTTodA?pwd=ir7g
Code: ir7g
- Feature Mapping(Sample Constraint)(Common Hamming)--RoPH
Dataset Link:
Baidu Yun Link: https://pan.baidu.com/s/1_uIulkuxcIcubvl5u3zsOA?pwd=46c4
Code: 46c4
- Online learning--SHDCH
Dataset Link:
Baidu Yun Link: https://pan.baidu.com/s/1-CsIJbvz3IFsmDgYk9BwYg?pwd=7hd8
Code: 7hd8
- Noise--MRL
Dataset Link:
Baidu Yun Link: https://pan.baidu.com/s/1FIrB-gXJa9VHKzLRQZf30Q?pwd=g3qt
Code: g3qt
- Online learning--LEMON
Dataset Link:
Baidu Yun Link: https://pan.baidu.com/s/1s5SnnAXo5wK7cmRs3zNq4w?pwd=jxjo
Code: jxjo
- Fine-grained--FGCrossNet
Dataset Link:
Baidu Yun Link: https://pan.baidu.com/s/1OYxCLmNKvPzwLIs5snTOlA?pwd=r80g
Code: r80g
- Noise--DECL
Dataset Link:
Baidu Yun Link: https://pan.baidu.com/s/1FcxkwOuuiUXnIl1LAatDLA?pwd=nl2z
Code: nl2z