ming024/FastSpeech2
An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"
PythonMIT
Issues
- 0
- 0
Self-Attention Mask Expansion Issue
#235 opened by fanoprcs - 1
CUDA out of memory
#233 opened by gaoyiyao - 0
amazing work,can it support that generate the phoneme delayed time sequence?
#234 opened by CasonTsai - 2
How to train with Indian Accent
#227 opened by Jainu-s - 2
How do I align my data
#208 opened by hkeliang - 1
how to slove indexSelectSmallIndex: block: [1,0,0], thread: [32,0,0] Assertion `srcIndex < srcSelectDimSize` failed when training ASILLE3
#220 opened by foolishcqx - 0
train.yaml
#232 opened by gaoyiyao - 0
- 0
preprocess.py
#230 opened by gaoyiyao - 0
FileNotFoundError: [Errno 2] No such file or directory: '/root/FastSpeech2-master/preprocessed_data/LJSpeech/mel/LJSpeech-mel-LJ016-0364.npy'
#229 opened by gaoyiyao - 0
- 1
Multi-language support in one sentence
#207 opened by shirubei - 4
MFA version
#225 opened by shreeshailgan - 4
Unused character embeddings?
#209 opened by g-milis - 2
Fluctuating training loss
#206 opened by 299792459b - 0
Discrepancy in the Number of Decoder Layers
#226 opened by shreeshailgan - 0
[CONTRIBUTION] Speech Dataset Generator
#224 opened by davidmartinrius - 0
ran out of input
#223 opened by ariameetgit - 1
How do you train the mfa acoustic model?
#198 opened by SandroChen - 0
not found modules
#222 opened by yumoqing - 0
Minor bug in loading vocoders
#221 opened by Mahyar24 - 4
About fine-tuning issues.
#199 opened by ltydd - 0
Duration of synthesis output is very short
#219 opened by hplanmuc - 0
The numbers of the audio samples and speakers mismatch of LibriTTS dataset
#218 opened by shiyanpei0826 - 2
ValueError: numpy.ndarray size changed, may indicate binary incompatibility. Expected 96 from C header, got 80 from PyObject
#210 opened by WJHBLUESAPPHIRE - 0
- 0
Inconvergence in pitch and energy loss
#213 opened by zhoufqing - 1
fine-tuning issue
#212 opened by zhoufqing - 0
there is an error scipy 1.5.0 while import
#211 opened by hevenangel - 1
synthesize.py LibriTTS RuntimeError: CUDA error: device-side assert triggered
#190 opened by Bingtai1015 - 1
RuntimeError: The size of tensor a (33) must match the size of tensor b (36) at non-singleton dimension 1
#204 opened by ltydd - 16
aishell3处理:使用mfa官方dict和声学模型处理aishell3
#188 opened by tuntun990606 - 0
1
#205 opened by sunnnnnnnny - 1
- 0
e_control not used during synthesis
#200 opened by lordzuko - 0
FastSpeech 2s
#197 opened by izzajalandoni - 3
model cantnot fit to data, and test voice is too bad when i use the paper configuration
#193 opened by hhm853610070 - 0
Pretained model link is invalid
#196 opened by Nueve879 - 0
Frequency of LibriTTS data. 24000 or 22050?
#195 opened by Zhongxu-Wang - 0
- 0
What should I do if I want to use phonemes and words to generate sentences at the same time?
#191 opened by tuntun990606 - 0
How about adding a discriminator to the Fastspeech2 to improve the naturalness of the spectrum?
#189 opened by Bingtai1015 - 1
synthesize chinese sentences
#178 opened by dayi1233 - 0
A custom text for inference
#187 opened by WGook - 0
The duration.npy file is picking zero as duration value for the phones with short duration in textgrids
#185 opened by nayanjha16 - 0
How would I train a TTS model on music? So instead of it talking from a prompt, it makes music from a prompt.
#183 opened by breadbrowser - 0
- 0
- 0
FastSpeechs training error
#179 opened by lunar333