deepseek-ai/DeepSeek-Coder

DeepSeek Coder: Let the Code Write Itself

PythonMIT

Issues

Finetune of FIM
#123 opened 3 months ago by shatealaboxiaowang
4
How to use fine-tuned model?
#157 opened a month ago by aldialimucaj
1
本地部署怎么实现vscode自动代码补全？
#156 opened a month ago by lingyezhixing
1
markdown格式的数据预训练
#154 opened a month ago by huangqingyi-code
3
使用vllm加载33b-base或33b-instruct后，使用DS-1000、Program-Aided Math Reasoning (PAL)评估集进行评估，得分很低，与论文上的数据不符
#159 opened a month ago by aigc001
0
使用vllm加速inference后输出容易不符合格式要求
#158 opened a month ago by zhengrongz
0
微调完的模型，如何跟基础模型合并？
#155 opened a month ago by libingbingd
1
Does DeepSeek-Coder have wasm related knowledge?
#150 opened 2 months ago by XinyuShe
1
clarification on the sentinel token format
#147 opened 2 months ago by Zane-XY
0
官方提供的微调训练脚本是否支持33B模型训练？(及训练相关问题)
#140 opened 2 months ago by tongyuhome
1
Please pass your input's `attention_mask` to obtain reliable results.
#112 opened 2 months ago by metero20000
1
Trying to finetune DeepSeek-Coder on custom Dataset
#137 opened 2 months ago by A-Janj
13
Why generate "GGGGG...." ,when the input string is longer than a certain length in GGUF model?
#151 opened 2 months ago by hzgdeerHo
1
请问支持function call吗？支持在RAG中实现inline citations吗？
#153 opened 2 months ago by hiber-niu
0
How can I do continue pretraining?
#145 opened 2 months ago by hwaking
1
ERROR: ImportError: cannot import name 'SyncManager' from partially initialized module 'multiprocessing.managers' (most likely due to a circular import)
#114 opened 4 months ago by kokolerk
3
Are NTP and FIM 2 separate stages of training, or are they combined?
#146 opened 2 months ago by Calvinnncy97
4
What is the base context length of the model before extension to 16k?
#152 opened 2 months ago by Calvinnncy97
1
微调后用代码中的evaluation做humaneval评测时报错Failed to extract code block with error `list index out of range`:
#111 opened 4 months ago by mst272
13
使用react调用接口错误
#148 opened 2 months ago by trookie2000
0
tokenizer.json issue creating gguf files
#124 opened 3 months ago by RonanKMcGovern
2
Catastrophic forgetting problem
#134 opened 3 months ago by shatealaboxiaowang
2
预训练细节（fim）
#113 opened 4 months ago by lightdf
3
Fail to fine-tune V1.5 model with custom llama script
#144 opened 2 months ago by lijierui
1
33B inference too slowly
#142 opened 2 months ago by ZJXNEFU
1
Pretraining code
#132 opened 2 months ago by Calvinnncy97
2
Leetcode数据集的构建脚本请问可以开源吗
#141 opened 2 months ago by jzzzf
0
如何构建微调的CoT数据
#139 opened 3 months ago by wangqn1
1
33B AWQ量化+vLLM部署问题
#138 opened 3 months ago by CarolXh
0
chat completion任务时输出大量<|EOT|> token
#136 opened 3 months ago by CarolXh
3
How is the amount of training data measured?
#128 opened 3 months ago by WentaoChen0813
1
deepseek-coder-7b-base-v1.5 tokenizer=LlamaTokenizerFast 为什么分词会有很多乱码字符呢?
#129 opened 3 months ago by zheng5yu9
1
Code to generate data
#131 opened 3 months ago by tbressers
1
模型推理完成后怎么一直占用显存呢？
#133 opened 3 months ago by chris-rong
1
Repository Level Code Completion format question
#116 opened 4 months ago by zch-cc
2
Undefined variable in `Evaluation/MBPP/human_eval/evaluation.py`
#126 opened 3 months ago by ya0guang
0
Reproduce FIM Evaluation
#130 opened 3 months ago by Hambaobao
1
请问一下最新发布的7b-v1.5模型不支持中间补全吗
#110 opened 4 months ago by Reve1ations
9
Detailed version information of test programs in different languages.
#127 opened 3 months ago by Hambaobao
0
Question about training dataset
#125 opened 3 months ago by TJ1999
0
Clarification Request on Discrepancies Between Appendix B and Section 4.1 Results
#119 opened 3 months ago by s-JoL
4
Construction of the FIM training data
#107 opened 4 months ago by shatealaboxiaowang
3
How many tokens of code in pretraining
#121 opened 3 months ago by bigeagle
2
Swift and Objective C?
#122 opened 3 months ago by rlaferla
1
eos_token_id for v1.5 model
#118 opened 3 months ago by G07cha
4
TensorRT Quantization Breaks for `LlamaLinearScalingRotaryEmbedding`
#117 opened 4 months ago by Sanger2000
0
Regex of HASDEPENDENCY in Dependency Parsing
#115 opened 4 months ago by alex8937
1
Training loss extremely noisy during fine-tuning and randomly goes to 0
#106 opened 4 months ago by zpx01
1
Possible generation bug?
#108 opened 4 months ago by kyesniper
2
HF chat-ui Prompt Template (DeepSeek Coder 6.7B)
#104 opened 4 months ago by GANJAC
0