huawei-noah/Speech-Backbones
This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.
Jupyter Notebook
Issues
- 3
- 11
- 1
Different Implementation of Diffusion Model
#35 opened by siyag12 - 1
Generated Samples are noisy
#37 opened by SandyPanda-MLDL - 1
Two questions about DiffVC
#31 opened by huangf79 - 1
mels_mode generation
#36 opened by Biyani404198 - 5
Model training question
#26 opened by Cpgrach - 1
DOESN'T WORK
#38 opened by yukiarimo - 1
A bug in model/tts.py
#28 opened by chep0k - 1
Not possible to build
#33 opened by asusdisciple - 0
GradTTS device compatibility
#34 opened by bukhalmae145 - 2
support for bigvgan
#27 opened by eschmidbauer - 4
about diffVC on Mandarin datasets
#24 opened by Theweekfoolish229 - 1
Why does the BNE-PPG-VC model in your demo perform better than the pre-trained model given in the original paper?
#20 opened by jiazj-jiazj - 2
- 2
About the prior loss and MAS algorithm
#18 opened by cantabile-kwok - 5
Multi-GPU training and expected epochs
#9 opened by bieltura - 11
Fine-tuning / Transfer Learning
#8 opened by williamluer - 4
Typo in some equations in GradTTS paper
#25 opened by cantabile-kwok - 0
- 2
How is `out_size` in `params` determined
#16 opened by cantabile-kwok - 2
Attention layer in GradTTS
#15 opened by patrickvonplaten - 1
ASR finetune ?
#11 opened by Enescigdem - 10
About end2end implementation
#12 opened by quangnh-2761 - 3
Generated outputs sound robotic in some cases!
#14 opened by aniketp02 - 1
Diffusion loss not decreasing
#13 opened by aniketp02 - 3
- 2
Grad-TTS in multispeaker setting
#5 opened by ajinkyakulkarni14 - 0
[Errno 13] Permission denied: '/home/user/app/Grad-TTS/model/monotonic_align/core.c'
#4 opened by AK391 - 2
Grad-TTS: Colab Notebook
#2 opened by AK391