/Geometry-Aware-Multi-Task-Learning-for-Binaural-Audio-Generation-from-Video

In this project we develop a multi-task framework that learns geometry-aware features for binaural audio generation by accounting for the underlying room impulse response, the visual stream's coherence with the sound source(s) positions, and the consistency in geometry of the sounding objects over time

Primary LanguagePython

This repository is not active