Dealing with words with multiple pronunciations

Question

Dealing with words with multiple pronunciations

ivancarapinha opened this issue 4 years ago · 2 comments

Hello,

Since this g2p transformer performs phonetic transcription word by word, how does it select the correct pronunciation for a word that has several possible pronunciations? This is very common for many nouns and verbs, for example, the noun "content" and the verb "to content" (to satisfy).

Thank you

Answer 1 · 2020-07-06T20:25:56.000Z

It supports n-best output in theory. As for using part of speech as input feature for training, it is also possible, but requires work on model architecture, and, correspondingly, code.

Answer 2 · 2021-07-01T16:35:03.000Z

Does that mean , as of now, for training the g2p model, input dictionary should only have 1-best pronunciations?
If not, how to handle multiple pronunciations in the training dictionary?