visual-audio
There are 6 repositories under visual-audio topic.
geminate/mwave
A Music Player that can show audio waveform
MinglangQiao/MVVA-Database
Database of "Learning to Predict Salient Faces: A Novel Visual-Audio Saliency Model", ECCV 2020
MuSAELab/Multimodal-dataset-catalog
This repository lists publicly available datasets for visual-audio, speech and audio, and biomedical signal related tasks.
gusanmaz/echosight
EchoSight is a tool that helps visually impaired individuals by audibly describing images taken with a Raspberry Pi Camera or inputted via image path or URL across different operating systems.
MinglangQiao/visual_audio_saliency
Code for "Learning to Predict Salient Faces: A Novel Visual-Audio Saliency Model", ECCV 2020
mx-mark/SPMNet
Source code for "Visually aligned sound generation via sound-producing motion parsing" (Published at Neurocomputing)