jasonppy/VoiceCraft
Zero-Shot Speech Editing and Text-to-Speech in the Wild
Jupyter NotebookNOASSERTION
Issues
- 1
- 0
Any hints on successfully deploying to replicate?
#160 opened by rrothenb - 3
Working in WSL but 10min+ inference
#153 opened by holunzoo12 - 0
Expected Windows performance?
#159 opened by ajkessel - 0
transformers necessary for tts_demo to work
#158 opened by ajkessel - 0
How to troubleshoot CUDA memory issues
#157 opened by ajkessel - 0
Easy to use with gradio client
#156 opened by gosharevo - 1
I finetuned voicecraft on commonvoice-french, here are some of my findings/thoughts
#154 opened by zmy1116 - 8
use facebook's pretrained encodec model
#144 opened by thivux - 2
Total duration of training dataset
#155 opened by changjinhan - 2
Batch Inference
#143 opened by nickmitchko - 4
VoiceCraft Fine-tune dataset preparation
#138 opened by rikabi89 - 3
New language training
#133 opened by Youssef-Barakat - 0
Question regarding the how gradient accumulation is done. (It looks like we didn't /accumulation_steps when backprop loss )
#151 opened by zmy1116 - 0
omageconf error
#152 opened by zdj97 - 0
Data preperation problem
#150 opened by Me210400 - 2
Why inference TTS doesn't need to mask?
#146 opened by YuXiangLo - 0
Have you tried to not delayed stacking input (Use delayed stacking for generation, but not on input)
#149 opened by zmy1116 - 2
- 1
espeak issues on macOS Sonoma 14.2.1
#148 opened by rajpython - 0
Any one have convertted the model to onnx?
#147 opened by JuSiuYu - 2
About masking when training TTS enhanced model
#129 opened by WoBuChiTang - 0
about silence tokens during inference
#145 opened by thivux - 0
How to use the train/fintuned weight
#132 opened by YuXiangLo - 0
Hugging Face demo no longer works
#142 opened by jkyndir - 2
Simplify installation
#139 opened by madey83 - 1
espeak not installed on system even when attempting to hardcode path to espeak
#141 opened by zero-stroke - 2
- 2
The docker file is broken
#120 opened by furqan4545 - 1
File Not Found Error when running ./data/phonemize_encodec_encode_hf.py file
#136 opened by AroonSankoh - 0
nvm
#137 opened by xalteropsx - 7
more training details of the TTS enhanced models
#111 opened by zjlww - 0
URL's unresponsive
#134 opened by AroonSankoh - 0
TTS enhanced model
#131 opened by WoBuChiTang - 3
Speaker Similarty
#130 opened by QajikHakobyan - 1
Questions regarding to the encodec model.
#128 opened by zmy1116 - 1
About streaming speech synthesis
#126 opened by hizening - 1
did not complete successfully: exit code: 1
#124 opened by sbmatch - 0
adapt model to the trainer API
#125 opened by not-lain - 0
Add 44100 model for huggingface
#123 opened by krokusgatan - 1
Colab Share Link issues and solution
#122 opened by Sewlell - 2
AssertionError: Could not resolve compression model checkpoint path: ./pretrained_models/encodec_4cb2048_giga.th
#116 opened by gjnave - 1
Discord Server for Voice Craft
#109 opened by yoesak - 5
Error when running Gradio.
#113 opened by Geisianny - 0
Gradio app broken locally.
#118 opened by Ph0rk0z - 1
"Align" button results in an error on HF
#117 opened by ThatDocBoi - 0
Model stuck in VRAM bug?
#115 opened by ThatDocBoi - 0
Could you please explain the different models, and the best one for TTS finetuning? When will the enhanced models be uploaded?
#114 opened by clearpathai - 3
HF space build is broken
#112 opened by fredvb - 0
Inquiring about Chinese language support.
#110 opened by 1044690543