QwenLM/qwen.cpp

C++ implementation of Qwen-LM

C++NOASSERTION

Issues

[BUG] Qwen-1.8-Chat，用llama.cpp量化为f16，然后推理回答错乱，请问1.8在llama.cpp还不支持吗？
#69 opened 5 months ago by Lyzin
4
[BUG] 多轮对话的 prompt 应该如何构建？
#81 opened 16 days ago by 791136190
0
pip install -U qwen-cpp 报错
#58 opened 6 months ago by micronetboy
3
qwen1.5 support?
#80 opened 3 months ago by anan1213095357
2
如何将gradio架构构建的前端和qwen-cpp推理代码连接？
#66 opened 6 months ago by tougeqaq
2
Does it support Qwen1.5 Model?
#78 opened 4 months ago by kicGit
8
使用qwen.cpp对原模型进行转化为什么文件反而增大了？
#79 opened 4 months ago by zzzcccxx
0
Python Binding之后，如何只使用cpu进行推理呢？
#77 opened 4 months ago by zzzcccxx
0
Support `--gpu-layers`
#45 opened 5 months ago by lindeer
7
Python Binding 报错
#33 opened 7 months ago by xinbingzhe
3
python binding无法正常安装
#43 opened 7 months ago by passionate11
2
在MacOS，用python调用qwen_cpp载入模型进行推理，只能启动CPU，无法使用GPU。
#54 opened 6 months ago by bigbigtooth
1
python-bind报错 ERROR: Could not build wheels for qwen-cpp, which is required to install pyproject.toml-based projects
#52 opened 6 months ago by zhangzai666
2
qwen_cpp可以提供api接口实现web服务么
#53 opened 6 months ago by zhangzai666
1
Qwen-7B-Chat WSL GPU Error: ankerl::unordered_dense::map::at(): key not found
#29 opened 8 months ago by dlutsniper
2
why missing "assistant" here
#76 opened 4 months ago by feixyz10
0
crash if compliing in debug mode, everything is ok if in release mode
#75 opened 4 months ago by feixyz10
0
如何下载tiktoken_cpp
#74 opened 5 months ago by eswulei
0
添加tokens生成速度
#73 opened 5 months ago by OliverQueen1466
0
请问用qwen.cpp量化后的模型如何使用optimum-benchmark进行性能基准测试,现在参照readme中所述只得到一个build文件夹，不清楚如何进行下一步的测试
#72 opened 5 months ago by suyu-zhang
0
Why does `TextStreamer` hold on punctuation?
#71 opened 5 months ago by Wovchena
0
windows 下使用qwen.cpp 问题
#70 opened 5 months ago by kingpingyue
0
希望团队能继续支持qwen.cpp
#60 opened 6 months ago by awtestergit
3
多轮会话
#67 opened 6 months ago by litongjava
0
💡 [REQUEST] - CPU 的 qwen-cpp 如何封装为一个 http 服务？
#65 opened 6 months ago by micronetboy
4
💡 [Question] - QwenCPP Python Binding 如何支持 BLAS CPU 加速
#64 opened 6 months ago by micronetboy
2
💡 [Question] - <title>qwen-cpp 只使用 cpu 和启用 cpu BLAS 加速, 在都不使用GPU的情况下，速度有多大差别？我测试没有差别
#63 opened 6 months ago by micronetboy
0
💡 [Question] - 您好，请教个问题，qwen-cpp BaseStreamer 如何通过std::string 构造一个　BaseStreamer？Ｃ＋＋代码少一个构造方式
#62 opened 6 months ago by micronetboy
0
您好，请教个问题，qwen-cpp BaseStreamer 如何通过std::string 构造一个　BaseStreamer？Ｃ＋＋代码少一个构造方式
#61 opened 6 months ago by micronetboy
0
为啥qwen.cpp在A100和A10性能差距很大
#56 opened 6 months ago by zhangzai666
1
Python Binding 如何支持BLAS CPU 加速
#59 opened 6 months ago by micronetboy
0
Python Binding在windows下无法编译
#37 opened 7 months ago by AppleJunJiang
1
CUDA error 2 at /home/qwen.cpp/third_party/ggml/src/ggml-cuda.cu:7196: out of memory
#55 opened 6 months ago by youngallien
0
72B模型量化需要多大内存，192G的内存都会被kill掉
#47 opened 6 months ago by sweetcard
9
请问7b的模型量化需要多大的内存，我这一直显示out of memory
#51 opened 6 months ago by WCSY-YG
0
qwen.cpp合并到llama.cpp中之后，对于<|im_start|>、<|im_end|>似乎没有正确处理
#50 opened 6 months ago by listeng
0
代码ctx_w_size
#49 opened 6 months ago by EveningLin
0
Support for AMD‘s ROCm
#46 opened 7 months ago by riverzhou
5
很容易出现 UnicodeDecodeError: 'utf-8' codec can't decode bytes
#36 opened 7 months ago by zhcharles
6
GGML_ASSERT when using a long prompt
#44 opened 7 months ago by Ayahuasec
2
Qwen-7B-Q4_0 works well on Mac M1, but Qwen-7B-Q8_0 cannot work with a ggml-metal error.
#42 opened 7 months ago by songkq
1
是否支持macos mps
#27 opened 8 months ago by timoyang
4
Does the Owen.cpp support macOS metal build?
#38 opened 7 months ago by AndreaChiChengdu
1
'QWenConfig' object has no attribute 'intermediate_size'
#34 opened 7 months ago by dingli06
0
64位linux系统pip安装qwen_cpp报错，不支持？
#32 opened 7 months ago by qianliyx
0
pip 安装 qwen-cpp 需要X86-32位的系统吗？能否支持X86-64位的系统
#31 opened 8 months ago by zhougekaibenchi
0
Inferential capability of qwen.cpp for Qwen-14b-chat is different compared with Qwen-14b-chat of CUDA
#30 opened 8 months ago by wertyac
0
Can you add an additional function to let convert.py support Qwen/Qwen-7B-Chat-Int4?
#28 opened 8 months ago by x1ngzai
0
什么时候出server功能
#26 opened 8 months ago by AppleJunJiang
0
disable mmap
#25 opened 8 months ago by Rinoahu
0