sberdevices/golos

Speaker labels

maksim-pankov opened this issue · 1 comments

Is it possible to add speaker Ids (depersonalized) of any kind to dataset jsonl files? I need them to distinguish utterances from different speakers to train the voice embedding model (for speaker identification and speaker diarization tasks).

Sadly we don't have such information for this dataset