Voice Activity Projection Models: Self-supervised learning of Turn-taking Events
Primary LanguagePythonMIT LicenseMIT