multimodal-representation-learning

There are 5 repositories under multimodal-representation-learning topic.

  • TXH-mercury/VALOR

    [TPAMI2024] Codes and Models for VALOR: Vision-Audio-Language Omni-Perception Pretraining Model and Dataset

    Language:Python268112315
  • Surrey-UP-Lab/RegionSpot

    Recognize Any Regions

    Language:Python1201154
  • ligerfotis/mvitac

    Self-Supervised Visual-Tactile Representation Learning via Multimodal Contrastive Training

    Language:Jupyter Notebook16175
  • holajoa/Adaptor-VL-SSL

    Freeze the Backbone: A Parameter-Efficient Contrastive Approach to Robust Medical Vision-Language Pre-training. Thesis of MSc AI degree at Imperial College London.

    Language:Jupyter Notebook3111
  • aurooj/VLM_SS

    Mini-batch selective sampling for knowledge adaption of VLMs for mammography.

    Language:Jupyter Notebook1100