Pinned issues
Issues
- 0
[Feature] Ukrainian language model
#563 opened by gianpaj - 0
Help! How can I pull the voice model through the interface and implement TTS in the WeChat TTS plug-in?
#562 opened by fkucnm - 2
[BUG] Normal LoRa finetune: Error(s) in loading state_dict for TextToSemantic
#479 opened by SinanAkkoyun - 1
- 2
Generate prompt through Reference Audio "Unsupported backend 'ffmpeg' specified; ", "please select one of ['sox', 'soundfile'] instead."
#557 opened by Jonathan-Wei - 5
How to fix the voice across generations ?
#554 opened by MankaranSingh - 0
- 1
How can i train a new language? [pt-br]
#549 opened by eletroswing - 2
[BUG]The results of running the same model on different computers vary greatly.
#514 opened by tao1261060556 - 5
ModuleNotFoundError: No module named 'ormsgpack'
#535 opened by one-pip - 1
- 4
[Feature]How can I specify that characters are silent and that pinyin or English words are read automatically?
#497 opened by xeoshow - 2
- 3
Does v1.4 model respect punctuation?
#542 opened by hoveychen - 2
ModuleNotFoundError: No module named 'triton.common'
#541 opened by zn123 - 3
[BUG]The audio generated by the streaming request is noisy. How can I solve this problem?
#509 opened by ArvinC - 6
The expanded size of the tensor (472) must match the existing size (1023) at non-singleton dimension 1.
#537 opened by lafreak - 3
- 1
Mac support?
#532 opened by Pareak - 3
- 2
Crash when launching new model
#526 opened by ApatheticWrath - 6
- 0
ModuleNotFoundError: No module named 'ormsgpack'
#525 opened by one-pip - 2
Error occured while finetuning fish-speech-1.4: index 5 is out of bounds for dimension 1 with size 5
#521 opened by wjddd - 2
[BUG]ASGI error occurred during WebUI operation
#511 opened by ievenight - 2
Why fish-speech implementation of transformer feedforward contains there linears?
#512 opened by JohnHerry - 4
- 6
[Feature] Improved API interface, no need to provide reference audio and text
#495 opened by fastfading - 2
[BUG] Package name error
#491 opened by shinoairisu - 8
- 3
[BUG] centos, environment installed successfully, various dependency errors
#503 opened by wukongbuku - 4
[BUG] Unable to open WebUI
#505 opened by ejiandan - 1
Which will be better? with indices or with hiddens?
#492 opened by JohnHerry - 0
- 1
[Feature] How to support new languages
#489 opened by ILG2021 - 2
After AutoDL is deployed, the 4090 graphics card uses GPU to infer the length of 1,000 Chinese characters, which takes 37 seconds. Is the audio length of 3 minutes and 10 seconds normal?
#493 opened by wosuiyu - 6
[BUG]Unable to continue last training session
#504 opened by tao1261060556 - 2
[Feature] The current streaming TTS seems to generate speech in the order of requests. Are there any plans to infer multiple requests simultaneously in the future?
#508 opened by world1tree - 1
- 10
[BUG] 按照linux方法安装报错
#490 opened by shinoairisu - 1
llama 训练速度
#483 opened by dukGuo - 1
- 4
[BUG] model re-compiled each time, time consuming.
#501 opened by didadida-r - 1
[BUG] API接口调用报错,都是500错误 Internal Server Error
#488 opened by zane8521 - 2
[BUG] api推理不支持mp3格式的参考音频
#475 opened by Jalen-Zhong - 4
- 1
[Feature] Save streaming audio to local path
#476 opened by Jalen-Zhong - 1
【疑问】语音情感是否支持?
#480 opened by l-dawei - 0
[Feature]Tesla P40这类FP16性能稍差的显卡如何使用FP32精度进行推理?
#484 opened by SakuraRK - 6
[help wanted] faster whisper model的作用
#474 opened by Jalen-Zhong