jaywalnut310/vits
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
PythonMIT
Issues
- 7
- 0
- 2
ETAs for training?
#223 opened by IngwiePhoenix - 1
Runtime error
#221 opened by SameerSri72 - 2
Continue training VITS
#211 opened by Pipe1213 - 1
- 4
getting Index error: dimension out of range (expected to be in range of [-1,0], but got got 1
#217 opened by SameerSri72 - 0
- 2
VITS codes failed to run for Python 3.10.12
#187 opened by CKAbundant - 1
是否可以通过java去实现训练和推理?
#219 opened by yuanliuzhizi - 0
License Question
#220 opened by UmerrAhsan - 2
VITS model Conversion to ONNX and TFLite.
#206 opened by tusharhchauhan - 0
can we calculate text'phoneme duration time form StochasticDurationPredictor or DurationPredictor ?
#218 opened by CasonTsai - 0
intersperse function explain
#216 opened by chnk58hoang - 1
Flip module explain
#214 opened by chnk58hoang - 3
RTX4090 training is very slow. Is there something wrong with my parameters?
#197 opened by Tsangchi-Lam - 2
Option for Single GPU?
#204 opened by hoomexsun - 0
anyway the "weight.data.zero_()","bias.data.zero_()" just for example or real code using?
#215 opened by FMzzq - 3
有大佬知道训练集有部分数据没有生成.spec.pt文件是什么原因?
#207 opened by jin1258804025 - 1
having numpy complex error
#199 opened by M-Usamah - 0
- 0
No audio generated
#210 opened by zhouhao27 - 1
Can't install requirement
#209 opened by zhouhao27 - 0
Multi node Support
#208 opened by Ahmed-Hossam-Aldeen - 6
segmentation fault after train a few steps
#181 opened by zhufeijuanjuan - 3
Transfer learning and fine-tuning tts
#191 opened by ToiYeuTien - 0
- 3
Is there a way to do batch inference?
#190 opened by SoulflareRC - 0
[CONTRIBUTION] Speech Dataset Generator
#203 opened by davidmartinrius - 1
NCCL error windows
#202 opened by Vubni - 0
Problems adding a new speaker
#200 opened by JoanisTriandafilidi - 0
How is the Out of Dictionary texts created?
#201 opened by deep-convai - 2
How to use trained model in inference?
#193 opened by solee0022 - 0
[Question] about improve quality audio
#198 opened by phamkhactu - 3
Duration Issue with the generated audio
#175 opened by Aliasgarsaifee - 1
Negative loss_dur
#196 opened by staceystaceystacey - 0
Fine-tune with multiple speakers' data.
#195 opened by soominjung - 1
- 0
How to locate the spreak time of each word
#192 opened by huwenkai26 - 0
Some of the losses increasing during training?
#189 opened by pouryajafarzadeh - 1
- 0
Peak Performance of Single vs. Multi-Speaker TTS Models: Seeking Insights and References
#185 opened by ikpark09 - 0
Training time too long
#184 opened by newton2149 - 1
RuntimeError: CUDA error: unknown error and torch.multiprocessing.spawn.ProcessRaisedException
#182 opened by daayaa - 1
大佬们救救我 AttributeError: 'DistributedDataParallel' object has no attribute 'infer' 报错
#183 opened by aaqq112 - 1
running vits on Ventura GPU
#180 opened by dnabanita7 - 0
Can I also train an Italian model?
#179 opened by Pnlvfx - 0
Is there some way to force a break?
#178 opened by vidigal - 0
Iserting new symbols in a pre-trained model
#177 opened by vidigal - 0
Issue with training at 8000Hz
#176 opened by athenasaurav