jasonppy/VoiceCraft
Zero-Shot Speech Editing and Text-to-Speech in the Wild
Jupyter NotebookNOASSERTION
Issues
- 2
- 1
- 7
more training details of the TTS enhanced models
#111 opened by zjlww - 0
File Not Found Error when running ./data/phonemize_encodec_encode_hf.py file
#136 opened by AroonSankoh - 0
URL's unresponsive
#134 opened by AroonSankoh - 3
- 0
New language training
#133 opened by Youssef-Barakat - 0
How to use the train/fintuned weight
#132 opened by YuXiangLo - 0
TTS enhanced model
#131 opened by WoBuChiTang - 3
Speaker Similarty
#130 opened by QajikHakobyan - 1
About masking when training TTS enhanced model
#129 opened by WoBuChiTang - 1
Questions regarding to the encodec model.
#128 opened by zmy1116 - 1
About streaming speech synthesis
#126 opened by hizening - 1
did not complete successfully: exit code: 1
#124 opened by sbmatch - 0
adapt model to the trainer API
#125 opened by not-lain - 0
Add 44100 model for huggingface
#123 opened by krokusgatan - 3
CUDA out of memory
#84 opened by WoBuChiTang - 2
Highest version of dependent libs/packages?
#108 opened by martinerk0 - 1
Colab Share Link issues and solution
#122 opened by Sewlell - 2
AssertionError: Could not resolve compression model checkpoint path: ./pretrained_models/encodec_4cb2048_giga.th
#116 opened by gjnave - 1
Discord Server for Voice Craft
#109 opened by yoesak - 5
Error when running Gradio.
#113 opened by Geisianny - 9
Where are the 330/830TTSEnhanced .pth models?
#107 opened by lukaszliniewicz - 1
The docker file is broken
#120 opened by furqan4545 - 0
Gradio app broken locally.
#118 opened by Ph0rk0z - 1
"Align" button results in an error on HF
#117 opened by ThatDocBoi - 1
Finetune dataset preparation
#106 opened by rikabi89 - 0
Model stuck in VRAM bug?
#115 opened by ThatDocBoi - 0
Could you please explain the different models, and the best one for TTS finetuning? When will the enhanced models be uploaded?
#114 opened by sthompson216 - 2
vocab size
#92 opened by WoBuChiTang - 3
HF space build is broken
#112 opened by fredvb - 2
text_vocab_size for model training
#102 opened by Lokshaw-Chau - 0
Inquiring about Chinese language support.
#110 opened by 1044690543 - 1
- 2
voicecraft gradio colab not working
#100 opened by tusharraskar - 2
How to train this model using dual GPU?
#103 opened by yoesak - 0
- 1
- 5
Gradio Colab Not working
#93 opened by Nick088Official - 1
There is "from audiocraft.solvers import CompressionSolver" in file "data/tokenizer.py", but where is "audiocraft"?
#87 opened by langlanglofa - 3
- 2
- 2
How long did the model take to train?
#88 opened by platform-kit - 0
- 2
RuntimeError: max(): Expected reduction dim to be specified for input.numel() == 0. Specify the reduction dim with the 'dim' argument.
#82 opened by rlenain - 2
training loss is nan
#81 opened by WoBuChiTang - 0
Watermarking and Content Authenticity?
#83 opened by scottslewis - 3
In Depth Install Guide?
#77 opened by Deadstarrr - 0
Validate baseline installation instructions
#80 opened by maxxrox - 1
Further train base model
#79 opened by mhenrichsen