BUTSpeechFIT/VBx

Does VBx ever use the training partition in AMI-diarization-setup?

alephpi opened this issue · 3 comments

When consulting https://github.com/BUTSpeechFIT/AMI-diarization-setup.git you proposed, I find it split into dev,test,train and I think dev is development, test is evaluation in your technical report, but what about the train? Have you ever used it in VBx (for xvector extractor training or inference)? I think no. Then what's the purpose of such a partition?

Hi,
We have not used the train set in this work. The split follows the Full-corpus-ASR partition of AMI https://groups.inf.ed.ac.uk/ami/corpus/datasets.shtml
This dataset precedes our VBx work and we wanted to be sure that there was consistency in the labels for all parts.

As for example/vad/ES2005a.lab, it was probably generated with a system VAD and that is why it does not match the oracle VAD from AMI-diarization-setup. In any case, it is only used as an example to validate that the code runs.

ok I see, thank you!