QwenLM/Qwen2.5-Coder
Qwen2.5-Coder is the code version of Qwen2.5, the large language model series developed by Qwen team, Alibaba Cloud.
Python
Issues
- 0
License missing for this repo
#367 opened by vishalvvr - 0
- 0
Other language support
#365 opened by pdhoward - 0
- 0
大家预训练或精调这个模型时,一般使用什么数据集呢?能分享下直接用于sft精调的如下格式的数据集吗?感谢!What datasets are generally used when pre-training or fine-tuning this model? Could you share a dataset in the following format that can be directly used for SFT fine-tuning? Thank you!
#358 opened by ykk624 - 2
- 12
Does qwen2.5-coder support function calling?
#180 opened by Muuut - 0
Qwen2.5-Code-Instruct Performance without DPO
#357 opened by DarrenRuan - 2
Is the ExecRepoBench dataset file missing?
#346 opened by kartikzheng - 0
ORPO微调的instruction设置问题
#356 opened by Sherww - 2
failed to install deepspeed==0.12.6+c00388a2 while finetuning using DPO method
#347 opened by salmankhh8 - 1
A consultation about code completion format: FIM (Fill In the Middle) + repository level information
#343 opened by mofanke - 1
- 6
代码补全功能无法正常使用。
#342 opened by BearCooike - 1
How to train model with Lora?
#217 opened by Zongru-Wang - 4
微调(dpo)的数据格式(json)能否给一个例子
#224 opened by whk6688 - 2
Has Qwen2.5-Coder-Instruct been trained on bird-dev?
#223 opened by LJHzju - 2
torch.cuda.OutOfMemoryError: CUDA out of memory
#228 opened by whk6688 - 1
什么时候支持多模态的代码大模型
#218 opened by GuoAccount - 5
Error encountered while training qwen-2.5-3b model using Qwen2.5-Coder/finetuning/sft/train.py
#171 opened by Yhw109 - 7
Train a model for a new language
#200 opened by boyu9 - 3
关于Qwen2.5-coder模型的一点问题
#206 opened by jsuper - 1
各个版本的显存占用情况有文档不?
#207 opened by zhaojigang - 1
Tareesh
#208 opened by TAREESH8086 - 2
关于CodeArena的一些问题
#209 opened by buaali - 1
Qwen2.5-Coder-Instruct-C model release
#215 opened by DrozdikGleb - 8
关于 CrossCodeEval 测试
#181 opened by yfzhou3993 - 5
Aider benchmark, DeepSeek-6.7B-Instruct model hardly generates SEARCH/REPLACE blocks, leading to very low pass rates
#192 opened by ytxmobile98 - 1
las tic en la region san martin
#201 opened by heidi149 - 2
adding documents (docx, md, pdf) into qwen2.5-coder
#203 opened by ozgecinko - 3
[Bug]: Qwen-Coder结束符是应该使用<endoftext>还是<im_end>?
#182 opened by m-maoyanyu - 4
- 1
Question about the trianing dataset
#190 opened by Hearum - 2
- 1
Error: cannot find tensor lm_head.weight
#173 opened by malikwirin - 1
torch.OutOfMemoryError: CUDA out of memory
#174 opened by old-kai - 2
the chat template `qwen2_5` corresponding to the model `qwen2_5-coder-32b-instruct-awq` is in chat format. Please use the `chat.completions` API.
#178 opened by gitYuZui - 4
- 2
tokenizer.json changed after ms-swift sft
#176 opened by oasis-0927 - 2
How can I use Qwen2.5-Coder 32B in Cursor?
#159 opened by IamTirion - 2
- 2
Error in evaluation on Qwen2.5-Coder
#172 opened by Zephyreeze - 10
- 2
the model continuously outputs repeated tokens
#163 opened by Yhw109 - 1
我明明指示的是翻译,却私自给我增加莫名其妙的内容
#170 opened by pio57019 - 3
max_new_token max value for Qwen2.5-Coder Instruct?
#165 opened by Thireus - 1
Dataset & Reproducible Experiment
#167 opened by fblgit - 3
希望跟上o1的步伐, 达到强逻辑推理的层次.
#158 opened by qwas982 - 1
这是已经训练好可以用的模型还是训练模型的代码呢?
#164 opened by lionel-daydayup - 2
Cursor edit + Qwen2.5-Coder prompt?
#156 opened by Owen-Qin