visual-audio

There are 6 repositories under visual-audio topic.

  • geminate/mwave

    A Music Player that can show audio waveform

    Language:JavaScript673315
  • MinglangQiao/MVVA-Database

    Database of "Learning to Predict Salient Faces: A Novel Visual-Audio Saliency Model", ECCV 2020

    Language:Python10221
  • MuSAELab/Multimodal-dataset-catalog

    This repository lists publicly available datasets for visual-audio, speech and audio, and biomedical signal related tasks.

  • gusanmaz/echosight

    EchoSight is a tool that helps visually impaired individuals by audibly describing images taken with a Raspberry Pi Camera or inputted via image path or URL across different operating systems.

    Language:Python2200
  • MinglangQiao/visual_audio_saliency

    Code for "Learning to Predict Salient Faces: A Novel Visual-Audio Saliency Model", ECCV 2020

    Language:Python2211
  • mx-mark/SPMNet

    Source code for "Visually aligned sound generation via sound-producing motion parsing" (Published at Neurocomputing)