/audio-visual-vad

Re-implementation of the paper "An End-to-End Multimodal Voice Activity Detection Using WaveNet Encoder and Residual Networks"

Primary LanguagePython