janvainer/speedyspeech

PythonBSD-3-Clause

Issues

ZeroDivisionError when extracting duration
#17 opened 4 years ago by Charlottecuc
2
Speedyspeech loss
#1 opened 4 years ago by junkeon
4
error in train
#60 opened a year ago by herokarimpoor
0
if sample_rate is 16k, the reference hyperparamerter hop_size, win_size ..... how to set
#59 opened a year ago by jiahong3837
1
create_dataset
#58 opened a year ago by herokarimpoor
0
dataset
#57 opened a year ago by herokarimpoor
0
create_dataset
#56 opened a year ago by herokarimpoor
0
Unclear inference result
#3 opened 4 years ago by junkeon
10
question about the SSIM loss
#53 opened 2 years ago by binbinxue
0
chain of exponentials
#52 opened 2 years ago by binbinxue
1
Positional encoding
#46 opened 2 years ago by VigneshBaskar
3
Residual shape in documentation
#45 opened 2 years ago by VigneshBaskar
2
RuntimeError: stack expects a non-empty TensorList
#18 opened 4 years ago by Charlottecuc
10
pip install -r requirements.txt ERROR
#37 opened 3 years ago by brentcty-2020
1
does speedyspeech support transfer learning?
#40 opened 3 years ago by bespsm
2
optimizations
#42 opened 3 years ago by rave974
0
how much time to train it?
#41 opened 3 years ago by rave974
1
Why is there no dropout in the code?
#35 opened 3 years ago by wizardk
2
duration loss calculated in log domain or linear domain
#19 opened 3 years ago by MorganCZY
7
Duplicated padding when calculating STFT transformation.
#34 opened 3 years ago by iclementine
3
A mistake in positional encoding
#23 opened 3 years ago by iclementine
6
Extract durations from the trained model
#16 opened 3 years ago by adnan-mehremic
10
Time for training teacher model is quite slow.
#33 opened 3 years ago by ductho9799
5
wav file quality - melgan vs griffinlim
#32 opened 3 years ago by dsplog
2
clarification request - normalization used in teacher model vs student
#29 opened 3 years ago by dsplog
4
Slow performance on aarch64 + GPU compared to x86 CPU
#11 opened 3 years ago by eqikkwkp25-cyber
5
Unstable audio generation towards the end of longer sentences using interference.py
#13 opened 3 years ago by eqikkwkp25-cyber
3
Is it possible to use other vocoders?
#20 opened 3 years ago by nmichaud
7
Issues for multi_speedyspeech?
#26 opened 3 years ago by TaoTaoFu
5
trimming silences in the beginning and end of the audio
#27 opened 3 years ago by dsplog
2
Should the teacher model predict next frame?
#22 opened 3 years ago by iclementine
2
Add demo inference server
#15 opened 3 years ago by janvainer2
0
better than Tacotron2?
#4 opened 4 years ago by joan126
2
the choice of positional encoding
#10 opened 4 years ago by MorganCZY
6
running error
#12 opened 4 years ago by unwritten
1
About SSIM loss
#14 opened 4 years ago by alexdemartos
2
I have support for other language?
#9 opened 4 years ago
2
How to traini MelGAN
#8 opened 4 years ago by Moon-sung-woo
4
RuntimeError: stack expects a non-empty TensorList
#7 opened 4 years ago by Moon-sung-woo
5
mismatch commits between master code and released speedyspeech.pth
#6 opened 4 years ago by MorganCZY
4
how to remove redundant silence in the synthesized wav
#5 opened 4 years ago by MorganCZY
3
Why full connected layer replaced with conv1d？
#2 opened 4 years ago by superhg2012
2