Adding new words to Filipino model

Question

Adding new words to Filipino model

moodpanda opened this issue 22 days ago · 8 comments

moodpanda commented 22 days ago

just wanna ask if is it possible to use my dataset it is formatted like this
audio, transcription in csv file.

thanks for answering

Answer 1 · 2024-05-13T08:15:38.000Z

up same question

Answer 2 · 2024-05-13T08:51:22.000Z

This question is too vague for me to answer. If you need help you need to provide the details. What problem are you tryign to solve, what data do you have and so on.

Answer 3 · 2024-05-13T08:59:29.000Z

I want the model to recognize new words or sentences using my own dataset. Currently, my dataset is formatted with each entry containing the path to an audio file and its corresponding transcription. I am new to Kaldi and unsure of how to properly format my data. inorder to perform model adaptation

my dataset example:
audio_file_path, transcription
data/chunk_001.wav, hello, world

Answer 4 · 2024-05-13T10:07:21.000Z

Vosk models are adapted with just text, not the audio + text.

What is the language of your dataset. What models did you try? What is the current accuracy of the model.

Answer 5 · 2024-05-13T13:20:29.000Z

my dataset language is filipino and I'm trying to use model adaptation vosk-model-tl-ph-generic-0.6 to add new words or sentence on the model vocabulary

Answer 6 · 2024-05-13T13:40:08.000Z

@moodpanda this model is precompiled and we can not modify it. You have to contact Fed directly for update, https://github.com/feddybear/flipside_ph only he can do it.

Answer 7 · 2024-05-13T13:42:04.000Z

@moodpanda this model is precompiled and we can not modify it. You have to contact Fed directly for update, https://github.com/feddybear/flipside_ph only he can do it.

got it thankyou very much I will try to email him if he still active thankyou

Answer 8 · 2024-05-13T13:46:57.000Z

He certainly can help you. Best.