hiyouga/LLaMA-Factory

Unify Efficient Fine-Tuning of 100+ LLMs

PythonApache-2.0

Issues

SFT之后的OLMo模板跟OLMo meta template不一致，后续评测时需要修改
#3860 opened 16 days ago
2
Docker 构建报错 RuntimeError: can't start new thread
#3859 opened 18 days ago
1
pretain增量后的chat重复输入的内容
#3858 opened 16 days ago
1
加载checkpoint相关问题
#3857 opened 16 days ago
1
关于全参数微调loss的问题
#3856 opened 16 days ago
1
License 缺少copyright名字
#3855 opened 16 days ago
1
KTO和DPO显存占用对比
#3854 opened 16 days ago
3
FSDP + Qlora Faill
#3853 opened 16 days ago
1
使用KTO进行多机训练过程中再进行验证，报错RuntimeError: still have inglight params [{id：388, "status":"AVALIBLE"}]
#3852 opened 18 days ago
5
llama3-8B-base模型全量微调mmlu掉点
#3851 opened 18 days ago
7
dpo训练后导出模型
#3850 opened 19 days ago
1
RuntimeError: Internal: could not parse ModelProto from /mnt/data/legalexp/LLM_exp/MiniCPM/minicpm_finetune_baseline/MiniCPM-2B-sft-bf16/tokenizer.model
#3849 opened 19 days ago
1
Meta-Llama-3-8B-Instruct进行微调，合并权重后的模型正常。对模型进行int4量化后，量化模型推理出现问题
#3848 opened 10 days ago
5
KTO训练报错
#3847 opened 19 days ago
2
ceval评测结果都为0
#3846 opened 19 days ago
1
历史提交包含git lfs，导致clone后无法上传到自建git平台
#3845 opened 19 days ago
2
出现乱码该如何解决
#3844 opened 19 days ago
0
如何自定义损失函数
#3843 opened 19 days ago
1
4X4090 ，llama_pro examples, Error: Expected attn_mask dtype to be bool or to match query dtype, but got attn_mask.dtype: c10::Half and query.dtype: float instead.
#3842 opened 19 days ago
3
lora微调是否支持deepspeed
#3841 opened 19 days ago
5
Ascend NPU训练成功但是推理报错
#3840 opened 19 days ago
1
【ascend】 RuntimeError: [ERROR] HCCL error in: torch_npu/csrc/distributed/ProcessGroupHCCL.cpp:64
#3839 opened 16 days ago
1
部署自己模型的api
#3838 opened 19 days ago
2
eval运行mmlu时，results.json中的结果少了一项
#3837 opened 10 days ago
1
[rank2]: ImportError: Megatron is not installed. please build it from source.
#3836 opened 19 days ago
1
llama factory推理时，如何配置seed参数
#3834 opened 19 days ago
3
the number of output lines is more than the number of input lines when batch inference (qwen 1.5, single node, multiple GPUs)
#3833 opened 19 days ago
0
Yi-34B模型使用双卡deepspeed zero2 训练加载模型时占用CPU 内存>200G 不足导致失败
#3832 opened 19 days ago
1
关于llama2变种模型做可控生成，orpo后效果没有lora微调效果好的问题
#3831 opened 20 days ago
2
如何使用本地模型进行训练
#3830 opened 20 days ago
1
attn_implementation 不起作用
#3828 opened 20 days ago
1
利用 vLLM 部署 OpenAI API
#3827 opened 20 days ago
1
deepspeed不起作用
#3826 opened 20 days ago
1
显存大小与readme不符
#3825 opened 20 days ago
3
训练日志记录不完整
#3824 opened 10 days ago
3
VLLM部署api自动使用Ray集群，部署失败
#3823 opened 20 days ago
3
推理过程的自回归过程中，如果想要修改生成token对应的logits值，以方便自定义采样过程，如何实现
#3822 opened 20 days ago
0
adapter_name_or_path 继续训练sft的adapter
#3821 opened 20 days ago
2
predict时如何生成多条预测结果
#3820 opened 20 days ago
3
使用hhrlhf数据集时报错
#3818 opened 20 days ago
1
使用自定义数据预训练报错，应当如何排查问题
#3817 opened 20 days ago
3
cuda 内存溢出
#3816 opened 20 days ago
1
在Mac M系芯片的电脑上是否只支持FP32精度的微调啊？
#3815 opened 20 days ago
1
使用Baichuan2-7B-Chat批量推理结果出现乱码
#3814 opened 20 days ago
2
llama3-8b-base 微调后重复输出
#3813 opened 20 days ago
1
Confused about the llama-pro demo. Why `num_layers` 49 should be divisible by `num_layer_trainable` 2.
#3811 opened 21 days ago
2
昇腾多卡训练问题
#3810 opened 21 days ago
0
7B模型全量微调60GB的执行脚本如何编写？
#3809 opened 21 days ago
1
请问现在还有DPO微调的整个过程的示例吗？
#3808 opened 21 days ago
1
llava - RuntimeError: Index put requires the source and destination dtypes match, got Half for the destination and Float for the source.
#3807 opened 21 days ago
4