Error when duration_control is <1
MaxGodTier opened this issue · 1 comments
MaxGodTier commented
I can set any value above 1 (ie. '--duration_control 1.9') to slow down the speaking rate, but can't do the opposite, anything below 1 (ie. 0.9) will throw this error message:
C:\FastPitchFormant>python synthesize.py --text "testing" --restore_step 600000 --mode single -p config/LJSpeech/preprocess.yaml -m config/LJSpeech/model.yaml -t config/LJSpeech/train.yaml --duration_control 0.9 --pitch_control 1
Removing weight norm...
Raw Text Sequence: testing
Phoneme Sequence: {T EH1 S T IH0 NG}
Traceback (most recent call last):
File "synthesize.py", line 207, in <module>
synthesize(model, args.restore_step, configs, vocoder, batchs, control_values)
File "synthesize.py", line 95, in synthesize
output = model(
File "C:\ProgramData\Anaconda3\envs\pttf2cu111py38\lib\site-packages\torch\nn\modules\module.py", line 1051, in _call_impl
return forward_call(*input, **kwargs)
File "C:\FastPitchFormant\model\FastPitchFormant.py", line 89, in forward
formant_hidden = self.formant_generator(h, mel_masks)
File "C:\ProgramData\Anaconda3\envs\pttf2cu111py38\lib\site-packages\torch\nn\modules\module.py", line 1051, in _call_impl
return forward_call(*input, **kwargs)
File "C:\FastPitchFormant\model\modules.py", line 329, in forward
output, enc_slf_attn = enc_layer(
File "C:\ProgramData\Anaconda3\envs\pttf2cu111py38\lib\site-packages\torch\nn\modules\module.py", line 1051, in _call_impl
return forward_call(*input, **kwargs)
File "C:\FastPitchFormant\model\blocks.py", line 109, in forward
enc_output, enc_slf_attn = self.slf_attn(
File "C:\ProgramData\Anaconda3\envs\pttf2cu111py38\lib\site-packages\torch\nn\modules\module.py", line 1051, in _call_impl
return forward_call(*input, **kwargs)
File "C:\FastPitchFormant\model\blocks.py", line 162, in forward
output, attn = self.attention(q, k, v, mask=mask)
File "C:\ProgramData\Anaconda3\envs\pttf2cu111py38\lib\site-packages\torch\nn\modules\module.py", line 1051, in _call_impl
return forward_call(*input, **kwargs)
File "C:\FastPitchFormant\model\blocks.py", line 189, in forward
attn = attn.masked_fill(mask, -np.inf)
RuntimeError: The size of tensor a (32) must match the size of tensor b (34) at non-singleton dimension 2
keonlee9420 commented
Hi @MaxGodTier, thank you for the report. I just fixed the code, and now it works. Try it out!