Reference: NVIDIA/tacotron2
- Put raw Japanese texts in ./filelists
- Put WAV files in ./wav
- (Optional) Download NVIDIA's pretrained model
- Open ./train.ipynb to install requirements and start training
- Download NVIDIA's WaveGlow model
- Open ./inference.ipynb to generate voice
File ./hparams.py line 30
何かあったらいつでも話して下さい。学院のことじゃなく、私事に関することでも何でも
nanikaacltaraitsudemohanashItekudasai.gakuiNnokotojanaku,shijinikaNsurukotodemonanidemo.
何かあったらいつでも話して下さい。学院のことじゃなく、私事に関することでも何でも
nani ka acl tara itsu demo hanashi te kudasai. gakuiN no koto ja naku, shiji nikaNsuru koto de mo naNdemo.
何かあったらいつでも話して下さい。学院のことじゃなく、私事に関することでも何でも
:na)nika a)cltara i)tsudemo ha(na)shIte ku(dasa)i.:ga(kuiNno ko(to)janaku,:shi)jini ka(Nsu)ru ko(to)demo na)nidemo.
Remember to change this line in ./inference.ipynb
sequence = np.array(text_to_sequence(text, ['japanese_cleaners']))[None, :]
- Model 1 ['japanese_cleaners']
- Model 2 ['japanese_tokenization_cleaners']
- Model 3 ['japanese_accent_cleaners']
- Model 1 ['japanese_tokenization_cleaners']