as-ideas/TransformerTTS
🤖💬 Transformer TTS: Implementation of a non-autoregressive Transformer based neural network for text to speech.
PythonNOASSERTION
Issues
- 0
[CONTRIBUTION] Speech Dataset Generator
#149 opened by davidmartinrius - 0
About model structute.
#145 opened by sunnnnnnnny - 1
how's the speed comparasion with tacotron2?
#78 opened by lucasjinreal - 0
DeepPhonemizer? (Phonemizer License Issue)
#143 opened by fakerybakery - 0
how Normalized dataset
#142 opened by herokarimpoor - 0
error to run a train_aligner.py
#141 opened by herokarimpoor - 0
Error : raise StopIteration StopIteration
#140 opened by farhadrahimiinfo - 2
Can't Finish PHONEMIZING On Google Colab.
#139 opened by The-Sad-Zewalian - 0
Alignments in PyTorch implementation
#138 opened by alexvwegen - 0
- 2
the missing of HiFiGAN model.pt
#130 opened by chikiuso - 0
Pause between sentence
#131 opened by chikiuso - 0
How to install on windows?
#126 opened by VieM2k18 - 5
Other Languages - Indonesian
#111 opened by danirisdiandita - 4
Get rid of the "robotic" sound
#121 opened by alexvwegen - 0
No module named 'decorator'
#124 opened by johndpope - 0
model.hdf5 file does not create
#122 opened by AyseEe501 - 0
layer.py TransposedCNNResNorm
#118 opened by zxp54332 - 0
Word timestamps
#117 opened by tylerweitzman - 1
inference error
#114 opened by sciai-ai - 0
how can i save the audio file if i am using thee pretrained model in google colab
#116 opened by baniyacoder - 2
Training in 3090 takes ~30 seconds per step
#112 opened by agonzalezd - 1
ERROR: while preparing training data
#113 opened by sciai-ai - 1
The training process takes forever `python train_tts.py --config config/training_config.yaml`
#110 opened by danirisdiandita - 4
Error occurred when finalizing GeneratorDataset iterator: Failed precondition: Python interpreter state is not initialized. The process may be terminated.
#97 opened by oyeamit - 6
Errors with dependencies and Summary during aligner, duration_extraction and training
#100 opened by prajwaljpj - 2
No numbers in phonemes set and collapse of whitespaces
#105 opened by anh - 0
model cannot predict
#80 opened by sciai-ai - 1
Sequence level kD in fastspeech
#99 opened by bkumardevan07 - 1
a activation function is missing in FFNResNorm.
#102 opened by hccho2 - 1
Convergence of forward model
#101 opened by bkumardevan07 - 7
Fine-tuning HifiGAN using output mels
#92 opened by kudanai - 3
ERROR during preprocessing
#93 opened by luis-vera - 3
- 1
duration not predicted correctly
#95 opened by theAayushbajaj - 1
Regd Forcing Encoder Attention Alignments
#96 opened by bkumardevan07 - 1
Where is train_tts.py?
#94 opened by kampores - 1
Why does GPU take up so much resources?
#91 opened by gucasbrg - 1
Does it support multi-gpu training?
#88 opened by holdurhorses - 0
how to convert Chinese to speech?
#87 opened by xiakj - 7
Regarding mel start and end token
#72 opened by bkumardevan07 - 1
Query regarding Model architecture
#81 opened by bkumardevan07 - 3
About reduction_factor_schedule
#79 opened by taylorlu - 0
The declaration of the'bucket_boundaries' variable in the data_handler.get_dataset() function is omitted, resulting in an error.
#77 opened by kampores - 1
Problemas with ## Prediction
#76 opened by luis-vera - 1
- 1
All elements in a batch must have the same rank
#74 opened by pourfard - 1
Can voice control be used somehow to make generated audio sound like a particular voice?
#73 opened by shamoons - 2
- 3