VASA-1: Lifelike Audio-Driven Talking Faces Generated in Real Time
This project implements a framework for generating lifelike talking faces with visual affective skills given a single static image and a speech audio clip.
pip install -r requirements.txt
pip install .