syang1993/gst-tacotron
A tensorflow implementation of the "Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis"
Python
Issues
- 0
- 0
- 0
- 0
- 0
Regarding the trained model
#46 opened by hariduttt - 30
eval form checkpoints
#4 opened by marymirzaei - 2
Training with custom data
#23 opened by wanshun123 - 0
Unable to reproduce results
#44 opened by Anchit1999 - 3
Pretrained Model
#26 opened by harismuneer - 1
Mumbling in synthesis
#45 opened by a-froghyar - 20
Style Token Layer implementation question
#1 opened by acetylSv - 4
Please update link to Blizzard data
#28 opened by simonkingedinburgh - 2
How to achieve style embedding with different weights of each token without reference audio?
#29 opened by bitwangyujia - 1
Pretrained Weights
#43 opened by ashish-roopan - 2
preprocessing the training data
#2 opened by marymirzaei - 0
What is in reference audio path?
#42 opened by Thien223 - 0
- 1
Path for Reference Audio
#38 opened by shrinidhin - 1
erro in eval.py
#39 opened by 1105060120 - 1
Check failed: dnnReLUCreateBackward_F32
#40 opened by miyoungvkim - 1
Error in datafeeder.py
#37 opened by shrinidhin - 4
- 0
Why use the 'tf.layer.conv1d' for query, key transformation instead of fully connected layer?
#36 opened by LEEYOONHYUNG - 0
- 0
Reference Encoder Padding
#34 opened by its-sandy - 1
- 5
Training Multi-Speaker Model.
#21 opened by sujithpadar - 12
- 7
No clear speech
#32 opened by ErnstTmp - 5
GMM Attention
#31 opened by ErnstTmp - 4
tensorflow.python.framework.errors_impl.InvalidArgumentError: Incompatible shapes: [12,2262,80] vs. [12,2000,80]
#30 opened by ErnstTmp - 5
- 0
Why there is some blank in the sythesized wav file when we use reference audio generation?
#24 opened by begeekmyfriend - 2
- 4
Sample Alignment Graph
#10 opened by fazlekarim - 4
Train as a Tacotron1 script problem
#15 opened by dazenhom - 1
poor alignment when synthesizing long sentences
#19 opened by moonnee - 4
Tone transfer
#13 opened by switchzts - 1
the model is hard to converge with LJSpeech
#18 opened by zyj008 - 3
poor alignment with test out-of-collection data
#16 opened by butterl - 13
- 0
Eval on soft voices
#17 opened by fazlekarim - 2
multi head attention
#12 opened by Young-Sun - 2
Preprocessing blizzard 2013 data
#11 opened by jsonko - 4
training time
#8 opened by Young-Sun - 6
data feeder error
#6 opened by fazlekarim - 3
What would happen if we merged datasets?
#5 opened by fazlekarim