pos_weight for reproducing unsupervised Buckeye & TIMIT

Question

pos_weight for reproducing unsupervised Buckeye & TIMIT

roger-tseng opened this issue a year ago · 3 comments

Hi Luke,

I've been trying to reproduce your unsupervised experiments, but I've only been able to get about 60% F1 so far, compared to your ~80% F1
May I ask what values for pos_weight you used for your unsupervised experiments with Buckeye and TIMIT?

Also, for Buckeye, do you re-split the audio recordings according to the psuedo-labels?

Answer 1 · 2023-12-30T00:30:39.000Z

pos_weight was ~1.5 for unsupervised experiments.

What do you mean by re-split the audio recordings?

Answer 2 · 2023-12-30T05:55:55.000Z

Thanks for the quick reply! I'll try it out.

About re-splitting, the Buckeye data is split into shorter segments at ground truth silence positions, which probably shouldn't be available in the unsupervised setup. So I was just wondering whether you redo the preprocessing w.r.t the pseudo-labels in some way.

Answer 3 · 2024-03-11T19:24:23.000Z

You could generate splits with https://github.com/zhenghuatan/rVAD.