multimodal-representation-learning

There are 5 repositories under multimodal-representation-learning topic.

TXH-mercury/VALOR
[TPAMI2024] Codes and Models for VALOR: Vision-Audio-Language Omni-Perception Pretraining Model and Dataset
Language:Python268 11 2315
Surrey-UP-Lab/RegionSpot
Recognize Any Regions
Language:Python120 1 154
ligerfotis/mvitac
Self-Supervised Visual-Tactile Representation Learning via Multimodal Contrastive Training
Language:Jupyter Notebook16 1 75
holajoa/Adaptor-VL-SSL
Freeze the Backbone: A Parameter-Efficient Contrastive Approach to Robust Medical Vision-Language Pre-training. Thesis of MSc AI degree at Imperial College London.
Language:Jupyter Notebook3 1 11
aurooj/VLM_SS
Mini-batch selective sampling for knowledge adaption of VLMs for mammography.
Language:Jupyter Notebook1 1 00