SM-Dictionaries
Base dictionaries
These dictionaries are given as parameter to Montreal Forced Aligner.
Four different languages are supported for the moment:
- French
- English
- German
- Spanish
Consonant-vowel dicts
These dictionaries indicates for a given base dictionary which phones are consonants and which phonems are vowels.
This is used by Sentence Mixing library
Add a new language
If you want to use a language that is supported by an MFA pretrained model but not present in SM-Dictionaries, you can follow this procedure:
- Find a text dictionary in the targetted language. This dictionary should be a big list of words
- Use MFA's g2p application:
- Download the corresponding g2p pretrained model
- Format you text dictionary by putting every word in lowercase, on the same line, separated by spaces
- Run command
mfa g2p [g2p model] [dictionary] output.dict
- Put every word in
output.dict
in uppercase
- Create a consonant-vowel dict where you specify which phonemes are consonants and which phonemes are vowels. You should take example on the supported languages of this repository. Do not forget to add the line
SPACE sp
Do noy hesitate to pull request your changes.