Does VBx ever use the training partition in AMI-diarization-setup?

Question

Does VBx ever use the training partition in AMI-diarization-setup?

alephpi opened this issue 9 months ago · 3 comments

When consulting https://github.com/BUTSpeechFIT/AMI-diarization-setup.git you proposed, I find it split into dev,test,train and I think dev is development, test is evaluation in your technical report, but what about the train? Have you ever used it in VBx (for xvector extractor training or inference)? I think no. Then what's the purpose of such a partition?

Answer 1 · 2024-03-16T09:04:09.000Z

Also I find that the VAD label in example/vad/ES2005a.lab is neither same as AMI-diarization-setup/only_words/labs/train/ES2005a.lab nor asAMI-diarization-setup/word_and_vocalsounds/labs/train/ES2005a.lab Could you explain?
https://github.com/BUTSpeechFIT/VBx/blob/master/example/vad/ES2005a.lab
https://github.com/BUTSpeechFIT/AMI-diarization-setup/blob/b158cbecae7c91298346c6bec1d6dcad473f61d3/only_words/labs/train/ES2005a.lab
https://github.com/BUTSpeechFIT/AMI-diarization-setup/blob/b158cbecae7c91298346c6bec1d6dcad473f61d3/word_and_vocalsounds/labs/train/ES2005a.lab

Answer 2 · 2024-03-18T07:11:33.000Z

Hi,
We have not used the train set in this work. The split follows the Full-corpus-ASR partition of AMI https://groups.inf.ed.ac.uk/ami/corpus/datasets.shtml
This dataset precedes our VBx work and we wanted to be sure that there was consistency in the labels for all parts.

As for example/vad/ES2005a.lab, it was probably generated with a system VAD and that is why it does not match the oracle VAD from AMI-diarization-setup. In any case, it is only used as an example to validate that the code runs.

Answer 3 · 2024-03-18T08:18:37.000Z

ok I see, thank you!