cmusphinx/g2p-seq2seq

Dealing with words with multiple pronunciations

ivancarapinha opened this issue · 2 comments

Hello,

Since this g2p transformer performs phonetic transcription word by word, how does it select the correct pronunciation for a word that has several possible pronunciations? This is very common for many nouns and verbs, for example, the noun "content" and the verb "to content" (to satisfy).

Thank you

It supports n-best output in theory. As for using part of speech as input feature for training, it is also possible, but requires work on model architecture, and, correspondingly, code.

Does that mean , as of now, for training the g2p model, input dictionary should only have 1-best pronunciations?
If not, how to handle multiple pronunciations in the training dictionary?