This list includes the most state-of-the-art projects on multi-modal-embeddings, which are
- with high impact and code stability
- high number of GitHub stars
- model used in other papers' evaluations
- paper accepted in top conferences (CVPR, ICML, .etc)
Note that some of projects with more than 3 modalities may not be widely used.
- ImageBind: One Embedding Space To Bind Them All
- High-Modality Multimodal Transformer: Quantifying Modality & Interaction Heterogeneity for High-Modality Representation Learning