heatz123/naturalspeech
A fully working pytorch implementation of NaturalSpeech (Tan et al., 2022)
Python
Issues
- 1
- 0
Flip module explain
#37 opened by chnk58hoang - 0
Train loss
#36 opened by yijingshihenxiule - 0
Get duration label
#35 opened by NDNM1408 - 5
How many time cost ?
#6 opened by yiwei0730 - 0
Bug in preprocessing_duration functions
#34 opened by NDNM1408 - 0
where is inference file ??
#33 opened by saichand018 - 0
Can I train with higher sampling rate
#32 opened by NDNM1408 - 3
- 1
Duration file
#28 opened by AvichalJain - 2
- 0
- 0
Error w stft: RuntimeError: stft requires the return_complex parameter be given for real inputs, and will further require that return_complex=True in a future PyTorch release.
#29 opened by IanRDomingo - 2
- 2
Error While running Train.py
#27 opened by athenasaurav - 2
IndexError: Dimension out of range (expected to be in range of [-1, 0], but got 1)
#25 opened by athenasaurav - 0
How the models perform against VITS?
#26 opened by jupinter - 6
Inference code
#9 opened by Murats7 - 1
AssertionError
#15 opened by tandongli192 - 1
Any sample code for inference?
#24 opened by narasimha-123 - 2
- 2
- 0
Is the project dead?
#21 opened by KyotoSunshine - 0
Any pre-trained models?
#20 opened by Shiro836 - 0
Russian dataset training
#19 opened by zhuangqirong - 3
Any plan of Reproducing mix-phoneme BERT ?
#10 opened by TinaChen95 - 1
About streaming output
#14 opened by kendo6666 - 3
- 0
- 3
about cuda of dtw
#11 opened by hdmjdp - 1
French support
#7 opened by projetmbc - 1
- 1
preprocess_texts issue
#4 opened by gaga820402 - 1
About sample quality
#2 opened by rishikksh20