QwenLM/Qwen2.5-Coder
Qwen2.5-Coder is the code version of Qwen2.5, the large language model series developed by Qwen team, Alibaba Cloud.
Python
Issues
- 5
Error encountered while training qwen-2.5-3b model using Qwen2.5-Coder/finetuning/sft/train.py
#171 opened by Yhw109 - 7
Train a model for a new language
#200 opened by boyu9 - 8
关于 CrossCodeEval 测试
#181 opened by yfzhou3993 - 5
Aider benchmark, DeepSeek-6.7B-Instruct model hardly generates SEARCH/REPLACE blocks, leading to very low pass rates
#192 opened by ytxmobile98 - 1
las tic en la region san martin
#201 opened by heidi149 - 5
Does qwen2.5-coder support function calling?
#180 opened by Muuut - 2
adding documents (docx, md, pdf) into qwen2.5-coder
#203 opened by ozgecinko - 3
[Bug]: Qwen-Coder结束符是应该使用<endoftext>还是<im_end>?
#182 opened by m-maoyanyu - 4
- 1
Question about the trianing dataset
#190 opened by Hearum - 2
- 1
Error: cannot find tensor lm_head.weight
#173 opened by malikwirin - 1
torch.OutOfMemoryError: CUDA out of memory
#174 opened by old-kai - 2
the chat template `qwen2_5` corresponding to the model `qwen2_5-coder-32b-instruct-awq` is in chat format. Please use the `chat.completions` API.
#178 opened by gitYuZui - 4
- 3
Why do I have a lot of `code>` in generated Java code? What should I do to get rid of them?
#142 opened by ytxmobile98 - 2
tokenizer.json changed after ms-swift sft
#176 opened by oasis-0927 - 2
How can I use Qwen2.5-Coder 32B in Cursor?
#159 opened by IamTirion - 1
How synthetic data were generated?
#147 opened by wasiahmad - 2
- 2
Error in evaluation on Qwen2.5-Coder
#172 opened by Zephyreeze - 10
- 2
the model continuously outputs repeated tokens
#163 opened by Yhw109 - 1
我明明指示的是翻译,却私自给我增加莫名其妙的内容
#170 opened by pio57019 - 3
max_new_token max value for Qwen2.5-Coder Instruct?
#165 opened by Thireus - 1
Dataset & Reproducible Experiment
#167 opened by fblgit - 3
希望跟上o1的步伐, 达到强逻辑推理的层次.
#158 opened by qwas982 - 1
这是已经训练好可以用的模型还是训练模型的代码呢?
#164 opened by lionel-daydayup - 6
- 2
Cursor edit + Qwen2.5-Coder prompt?
#156 opened by Owen-Qin - 4
32B model
#135 opened by caiduoduo12138 - 1
Qwen2.5-Coder-7B-Instruct 的 BenchMark 结果
#155 opened by FearfulTomcat27 - 2
KeyError: 'qwen2' when running example code on Windows WSL Ubuntu. Successfully installed requirements.txt
#151 opened by andytriboletti - 1
I periodically encounter infinite generations
#152 opened by Swipe4057 - 1
Url endpoint for API Key of qwen (Alibabacloud)
#145 opened by erik445445 - 1
repo级别预训练数据构造
#140 opened by shibo950912 - 1
Default temperature
#146 opened by ssk705 - 5
预训练fim数据切割问题
#122 opened by boshi950912 - 3
继续预训练
#126 opened by boshi950912 - 2
请问官方是否有对Qwen2.5-Coder-7B-Instruct做过FIM相关数据集的评测?
#134 opened by kartikzheng - 1
技术报告里提到的Input-CoT和Output-CoT怎么理解?
#141 opened by chloefresh - 1
CodeQwen1.5-7B-Chat
#144 opened by ssk705 - 7
请问 Spider Text-to-SQL 数据集在 Qwen2.5-Coder 的训练集中吗?
#132 opened by ruilinWho - 3
- 2
7B-模型支持128K sequence length,但是config里面没有关于yarn的相关rope_type配置,还是用的是默认的 default?这个是为啥呀
#124 opened by Ericyfliu - 1
- 2
How to solve the error of model sft?
#128 opened by chenqi-205 - 0
Has it been tested on cross code eval?
#125 opened by mst272 - 0
通义千问coder团队,你好,
#121 opened by boshi950912 - 0
HumanEval Infilling benchmark
#120 opened by Cppowboy