Issues
- 3
Support for Llama 3.1 model
#36 opened by sbmandava - 0
bm1684x的芯片是否在seq_length增加时,推理时间就会变得很长的问题?
#55 opened by iwantofun - 0
FTL非常高,这属于正常吗?
#54 opened by iwantofun - 0
请问后续有支持InternVl 2-8B的计划嘛?
#53 opened by Li-Lai - 1
InternVL2的C++代码疑问?
#52 opened by w8501 - 0
Llama2 demo下make出错
#51 opened by lishaung99 - 1
How to support qwen2.5-7b-instruction ?
#49 opened by wuhuzi - 1
IMPORTANT: LICENSE file required
#46 opened by lazyparser - 1
有商业需求,请问一下各模型的性能怎么样。
#44 opened by Arthassssss - 2
- 2
Qwen2可以转onnx,转bmodel的时候出现以下问题
#30 opened by tzhang2014 - 2
- 2
ChatGLM3的web demo无法运行成功
#18 opened by S0uLHun43r - 2
导出onnx出现warning
#16 opened by githubzjj1 - 3
Qwen1.5 1b8和Qwen2 7b推理到最后出现重复性回答
#35 opened by loredunk - 1
基于LLAVA的多模态大模型也是主流模型,请问有支持计划吗?
#47 opened by Li-Lai - 1
glm4v 啥时候提供已转好的bmodel下载链接呢
#48 opened by tang799319844 - 3
导出MiniCPM-V-2_6的ONNX模型出错
#45 opened by thunder95 - 1
请问后续有支撑qwen2-vl的计划嘛?
#43 opened by kong1414 - 1
常见问题的Q11,[a53lite runtimellerror] get function send api error, ret2,有没有其他的排错思路?
#41 opened by loredunk - 2
请问支持Langchain-Chatchat吗?
#40 opened by PeiwenWu - 3
- 2
请问有支持iFlytekSpark的模型计划吗
#38 opened by tzhang2014 - 3
llma3 is not available after conversion
#13 opened by Bao0ne - 2
问题太长导致回复到一半就终止了
#12 opened by githubzjj1 - 1
ChatGLM3-6B转onnx报错:torch.onnx.errors.CheckerError: The model does not have an ir_version set properly.
#11 opened by kurosakiharachan - 6
运行llama3输出为乱序
#29 opened by jayzou3773 - 0
chat.cpp:141: void Qwen::init(const std::vector<int>&, std::string): Assertion `true == ret' failed.
#34 opened by loredunk - 0
标准提问格式,请大家按照这个方式进行提问~(重要)(非常重要)
#31 opened by chuxiaoyi2023 - 1
- 0
Waiting for Qwen2 gradio web demo
#27 opened by zifeng-radxa - 1
Qwen2-7B-Instruct 导出 onnx 报错
#25 opened by zifeng-radxa - 1
请问我想将列表里没有的大模型转成bmodel,应该怎么做?
#24 opened by xinyinan9527 - 17
转换qwen1.5出现的问题
#3 opened by yuyun2000 - 0
- 0
Llama3 pipeline output � error decode
#19 opened by zifeng-radxa - 2
万人血书MiniCPM-2B!
#1 opened by xiabo0816 - 2
Web client not working
#10 opened by Bao0ne - 6
Unable to run llama2-7b according to readme
#7 opened by Bao0ne - 1
- 2
GPU memory allocation failure
#8 opened by szxysdt - 1
刷机包下载不成功,报错:No available servers found
#6 opened by shanchenjie - 5
能不能对不同的模型(尤其是差异很大的模型,比如 SD),分别写下教程?
#2 opened by raw34 - 2
关于CV180x的适配问题
#4 opened by xpww