QwenLM/Qwen-Audio
The official repo of Qwen-Audio (通义千问-Audio) chat & pretrained large audio language model proposed by Alibaba Cloud.
PythonNOASSERTION
Issues
- 1
Compute Requirements and Execution Time
#76 opened by Sedherthe - 0
请问有计划接入TTS模块吗
#74 opened by CD678 - 2
有onnx格式的模型吗
#56 opened by whk6688 - 1
是不支持中文提示词吗
#53 opened by GioGioBond - 5
- 1
问题请教,关于gradio的问题,我在本地部署好了,想在手机上使用,显示找不到麦克风
#28 opened by cl886699 - 1
请问Qwen-audio的训练速度,阿里官方达到多少?
#46 opened by luboxu - 1
能否获得hidden表示?
#63 opened by Kristopher-Chen - 0
在rustc 1.80.1编译tokenizers v0.13.3报错
#72 opened by martinzh717 - 9
qwen-audio处理长音频(五分钟左右)结果只输出前面20秒的文本是什么原因?
#34 opened by Wolverhampton0 - 2
可以问一下微调代码的公开的计划嘛?预计什么时候能开源呢?非常感谢!!!
#40 opened by icemoon-creative - 1
- 1
- 0
Get token in predict ?
#71 opened by CungNguyenHuy - 0
- 2
Clarification | Datasets used for training.
#65 opened by Iosifts - 0
Problems for speech translation tasks
#68 opened by ShoutaoGuo - 0
Input multiple audio file to audio encoder
#66 opened by DevKiHyun - 0
Evaluation script for VSC task seems not correct
#64 opened by mlxu995 - 1
- 0
qwen-audio和lauragpt的相关问题讨论
#62 opened by wwfcnu - 0
关于训练数据中不同语言分布情况
#61 opened by shihuai - 0
本地部署需要多少算力‘’
#60 opened by Gpwner - 1
- 0
chat模型,相同文本问题,不同音频,每次ASR返回结果都一样
#58 opened by LiXuanming - 4
use of whisper audio encoder
#33 opened by x75 - 0
how can i chat in demo
#55 opened by lzl-mt - 2
- 1
请问是否支持 VLLM 等api部署
#52 opened by su-zelong - 0
微信群满了
#49 opened by zhangfan-algo - 2
allow_pickle=False
#48 opened by Leejl0011 - 4
wechat full
#36 opened by lixf071213 - 0
支持本地api调用吗?
#47 opened by dfengpo - 2
qwen-audio 微调
#38 opened by wjfwjfwjf - 0
Few-shot Examples
#44 opened by aqibsaeed - 2
请问prompt要怎么写才能获得单个task的信息或者想要的task的信息?
#32 opened by wjyfelicity - 0
确定给的本地模型没问题吗
#41 opened by wukongbuku - 0
Infer eval_audio目录下的multi-task eval脚本,发现模型针对batch 解码性能衰减很快,请问是训练时候attention mask 或者tokenizer padding部分处理有问题吗?
#43 opened by yangjiabupt - 0
报错,requests.exceptions.HTTPError: Response details: 404 page not found, Request id: ab8a478639c847c6bbb41438e4d8606e
#42 opened by wukongbuku - 0
End of sentence id
#35 opened by marcoyang1998 - 1
关于Output Instruction的问题
#31 opened by jwang1993 - 1
哪里可以进行指令微调呢?什么时候开放有具体排期吗?
#21 opened by David19970306 - 0
是否考虑加入whisper.cpp的支持?
#30 opened by dyt06 - 0
- 0
- 2
SFT use lora? or finetune all parameters?
#24 opened by yangjiabupt - 1
The number of people in the WeChat group is full. Can you update the WeChat group QR code?
#25 opened by rookie0607 - 3
- 0
- 0
关于Clotho AQA数据集?
#19 opened by liziming5353