stanleylsx/llms_tool
一个基于HuggingFace开发的大语言模型训练、测试工具。支持各模型的webui、终端预测,低参数量及全参数模型训练(预训练、SFT、RM、PPO、DPO)和融合、量化。
PythonApache-2.0
Issues
- 16
- 0
sft train报错
#76 opened by langgege-cqu - 1
AttributeError: 'DataManager' object has no attribute 'generating_args_preprocess'
#75 opened by autaugaville - 1
- 1
QLora似乎不能和deepspeed zero3一起使用?
#73 opened by shaomai00 - 0
deepspeed报错
#72 opened by wq343580510 - 1
大佬,能尝试做训练用的WebUI吗?
#69 opened by win10ogod - 2
关于模型预训练
#67 opened by clclclaiggg - 0
扩展词表代码需优化
#65 opened by tiandiweizun - 3
sft时不输出eval loss
#60 opened by shaomai00 - 0
baichuan2-13b-chat的deepspeed训练报错,是dpo训练
#62 opened by MingJiaAn - 1
大神能建个微信群或者留个联系方式吗?
#61 opened by MingJiaAn - 1
期待預訓練代碼
#59 opened by indiejoseph - 1
使用prefix-tuning微调Qwen模型时报错
#47 opened by FelixZhang7 - 0
Pls support RWKV world model
#4 opened by xiaol - 1
关于权重合并
#42 opened by FelixZhang7 - 4
可不可以提供一下生成json的脚本,我这边生成的会报错,是编码格式的问题吗?
#39 opened by FelixZhang7 - 0
关于权重合并
#41 opened by FelixZhang7