[INTERSPEECH 2022] This dataset is designed for multi-modal speaker diarization and lip-speech synchronization in the wild.
Primary LanguageHTMLOtherNOASSERTION