multimodal-representation-learning
There are 5 repositories under multimodal-representation-learning topic.
TXH-mercury/VALOR
[TPAMI2024] Codes and Models for VALOR: Vision-Audio-Language Omni-Perception Pretraining Model and Dataset
Surrey-UP-Lab/RegionSpot
Recognize Any Regions
ligerfotis/mvitac
Self-Supervised Visual-Tactile Representation Learning via Multimodal Contrastive Training
holajoa/Adaptor-VL-SSL
Freeze the Backbone: A Parameter-Efficient Contrastive Approach to Robust Medical Vision-Language Pre-training. Thesis of MSc AI degree at Imperial College London.
aurooj/VLM_SS
Mini-batch selective sampling for knowledge adaption of VLMs for mammography.