FunAudioLLM/CosyVoice
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
PythonApache-2.0
Issues
- 1
在参考run.sh微调cosyvoice 2.0时,发现qwen_pretrain_path是空的
#748 opened by hixiaoxiong - 1
- 3
cosyvoice 2.0感觉好牛,但是试了一下推理速度太慢了,不知道有没有后续优化的计划?
#739 opened by Jimmy-L99 - 0
Cosyvoice 2.0 has better non streaming performance, but the inference speed is too slow. Can we use an inference engine?
#747 opened by Demon2958 - 2
CosyVoice 2.0 在此prompt音频下乱说话
#740 opened by tomjamescn - 2
CosyVoice2 会出现掉字的情况
#746 opened by gaspire - 9
请问150ms的流式延迟实验条件是什么
#723 opened by wanghuihhh - 2
- 1
[BUG] `rtf` Greater Than 1 on V100 GPU
#730 opened by SolomonLeon - 7
CosyVoice2-0.5B does not have spk2info.pt
#729 opened by ZxnSnowy - 0
speech_tokenizer_v2.onnx GPU推理很慢
#742 opened by zchoi - 10
更新了之后运行示例代码报错 KeyError: 'additional_special_tokens'
#724 opened by laishujie - 4
使用CosyVoice2-0.5B模型webui预训练音色不显示
#738 opened by Jandown - 2
- 8
使用gitee上的操作运行, 已经启动了, 但是推理的时候还是报错AttributeError: 'ConditionalDecoder' object has no attribute 'static_chunk_size'
#732 opened by 13576245149 - 6
zero-shot推理
#718 opened by wwfcnu - 2
- 0
- 3
about cosyvoice2
#727 opened by shanhaidexiamo - 0
cosy 2.0 0.5b disappearance on hf
#733 opened by darkacorn - 0
zero-shot voice cloning 支持生成广西方言吗?
#716 opened by APeiZou - 0
instruct text 支持
#726 opened by Hunterhuan - 1
AttributeError: 'NoneType' object has no attribute 'create_execution_context'
#720 opened by donstang - 1
- 1
transformer is not installed, please install it if you want to use related modules
#711 opened by bk111 - 1
能否在语音合成的同时出srt字幕,还是得合成完后再whisper识别出来呢?
#712 opened by bk111 - 1
如何实现批量并发执行?
#713 opened by chenzhongzheng - 4
在运行run.sh进行到推理阶段时,在inference.py中调用CosyVoiceModel类创建对象时少传入了一个bool类型的参数是怎么回事呢?
#708 opened by Nuyoah111111 - 1
微调时,评估的loss一直上升
#709 opened by wxzyd123 - 3
多个请求的时候gpu利用率100%卡死
#706 opened by xuejiale929 - 0
在运行run.sh进行到推理阶段时,在inference.py中调用CosyVoiceModel类创建对象时少传入了一个bool类型的参数是怎么回事呢?
#707 opened by Nuyoah111111 - 1
webui generates voice messages all the time
#705 opened by ZxnSnowy - 1
Several lines that frequently cause endless loops
#704 opened by cksqs - 3
can i retrain the flow and llm based on a new pre-trained vocoder? or further fine tune?
#702 opened by hildazzz - 3
需要流式输入和流式输出
#700 opened by Hkaisense - 2
是否有什么办法可以提高模型不陷入死循环的稳定性
#701 opened by TheHonestBob - 1
音频生成失败,这是什么问题
#703 opened by HuangBrant - 1
Speech Tokenizer是否会开源torch推理代码
#699 opened by Maokui-He - 1
如何禁止ttsfrd 打印 tn xxx to xxx 的log
#698 opened by vra - 1
- 7
配置bug
#696 opened by Guanjunyun - 6
python3执行没反应,python执行报错
#695 opened by iloveuaa - 2
如何开启LOG日志,启动无反应
#690 opened by iloveuaa - 4
Win11 按文档安装启动webUI无任何反应 直接跳出了
#689 opened by iloveuaa - 2
- 1
- 1
- 4
分段文本连续调用CosyVoice-300M的tts推理,音频后段电音越来越严重。
#686 opened by ZzMLvzZ-792998470 - 1
有4080显卡,却只使用CPU是怎么回事?GPU使用率一直是0%。
#687 opened by ShawnZeng - 1
ERROR: ttsfrd-0.3.6-cp38-cp38-linux_x86_64.whl is not a supported wheel on this platform.
#685 opened by LIUKAI0815