audio-language

There are 5 repositories under audio-language topic.

  • OFA-Sys/ONE-PEACE

    A general representation model across vision, audio, language modalities. Paper: ONE-PEACE: Exploring One General Representation Model Toward Unlimited Modalities

    Language:Python961145763
  • TXH-mercury/VAST

    Code and Model for VAST: A Vision-Audio-Subtitle-Text Omni-Modality Foundation Model and Dataset

    Language:Jupyter Notebook237182715
  • AudioLLMs/AudioLLM

    Audio Large Language Models

  • Sreyan88/GAMA

    Code for the paper: GAMA: A Large Audio-Language Model with Advanced Audio Understanding and Complex Reasoning Abilities

    Language:Python76777
  • Sreyan88/CompA

    Code for ICLR 2024 Paper: CompA: Addressing the Gap in Compositional Reasoning in Audio-Language Models

    Language:Python12302