OFA-Sys/gsm8k-ScRel
Codes and Data for Scaling Relationship on Learning Mathematical Reasoning with Large Language Models
Python
Issues
- 0
关于MuggleMath的数据增强代码
#24 opened by hlcle - 1
About the collect_rejection_sampling.py
#23 opened by jzh9830 - 8
if you have some plan to release data?
#18 opened by ngc7292 - 1
- 3
The RFT data
#19 opened by ZIKEYUAN - 6
70B training fails
#16 opened by kumar-shridhar - 1
- 8
the inference of OFA-Sys/gsm8k-rft-llama13b2-u13b has shape error: 13Bllama2的u13b版本推理时出现矩阵形状错误
#14 opened by AegeanYan - 12
加载作者开源的 OFA-Sys/gsm8k-rft-llama7b-u13b 报错
#8 opened by Haskely - 3
Enviroment
#13 opened by nuochenpku - 4
When will release 33b RFT model?
#11 opened by nuochenpku - 2
Reproducing llama7b2-sft problem
#12 opened by huijiawu0 - 14
Release the RFT 7B model
#2 opened by wenhuchen - 3
When will release model of LLama13b RFT model?
#10 opened by xingweiqu - 23
- 1
关于源码的一些细节问题
#7 opened by Haskely - 1
- 5
Questions about RFT Inference
#6 opened by waterhorse1 - 3
- 4
Release RFT datasets
#4 opened by nuochenpku - 2
Missing test.py file
#3 opened by huijiawu0