jtkim-kaist/VAD

ACAM always detect badly on the start of a corpus

lyapple2008 opened this issue · 2 comments

As the title said, I found the corpus at the beginning always be detected as non-speech. Can you explain it?
image

Hi, is there any silence in front of your sample, if not, the result may be not good. Because ACAM is context based model, there should be some samples to capture the speech context. Please send me your sample to jtkim@kaist.ac.kr I'll debug it for you.

Thank you for your reply. And I had sent the test audio to your email.