RUCKBReasoning/codes

The source code of CodeS (SIGMOD 2024).

PythonApache-2.0

Issues

Inference on Custom Use Case w/o fine tuning
#23 opened 4 months ago by Mrs-Hudson
1
使用build_contents_index.py进行数据索引，程序结束后没有保存任何数据文件
#26 opened 4 months ago by lazyfuzzyguy
0
What are the hardware requirements for reproducing Codes15B?
#25 opened 5 months ago by kanseaveg
3
使用 seeklhy/codes-7b 模型进行sql generate 微调，8*A800 *80G 显存溢出问题
#21 opened 7 months ago by dshwei
2
few_shot和sft的参数选择
#24 opened 7 months ago by zzkzzkjsw
4
结果并不能达到理想的回复
#18 opened 7 months ago by gabrielpondc
1
您的实验中增量预训练选择starcoder基模型，如果选择code llama2-7b基模型，请问有做过实验对比吗？
#17 opened 7 months ago by dshwei
3
关于sql generate model 训练的细节
#15 opened 7 months ago by dshwei
1
schema linker 模型微调不能复现结果
#22 opened 7 months ago by dshwei
1
训练模型复现
#20 opened 7 months ago by dshwei
1
Regarding schema filtering
#19 opened 7 months ago by Hari-Dorbala
1
双向数据增强技术请教
#16 opened 7 months ago by mojianpo
1
codes-7b-bird 测试结果
#14 opened 9 months ago by dshwei
8
codes-7b-bird 模型在本地测试bird dev 结果复现
#13 opened 9 months ago by dshwei
1
when I tokenize three datasets , There are 14029574132 tokens in the pre train corpus ,toal about 14T
#11 opened 9 months ago by dshwei
3
Is it possible to upload basic models to Alibaba’s modelscope platform to facilitate downloading by domestic users?
#9 opened 9 months ago by CycloneBoy
3
The size of incremental pretraining datasets.
#7 opened 9 months ago by binz98
1
模型微调
#8 opened 9 months ago by shellhuang1227
8
I have found that using the word segmentation method provided in your code, the token_id appears beyond the length of the dictionary
#10 opened 9 months ago by dshwei
1
增量预训练的数据集中包含有starcoder 训练时使用的sql数据，这样重复使用slq数据进行训练，是否会出现遗忘其他问题
#12 opened 9 months ago by dshwei
1
Tokenize error
#6 opened 9 months ago by Kaimary
2
codes-7b-merged load error
#5 opened 10 months ago by ddingwang12
2
超出上下文限制的情况如何处理
#2 opened a year ago by steph730
3
数据集相关问题
#3 opened a year ago by steph730
6
Spider results
#4 opened a year ago by lwmlyy
1
请问这里使用simCSE的目的是什么呢
#1 opened a year ago by steph730
4