ymcui/Chinese-LLaMA-Alpaca-3
中文羊驼大模型三期项目 (Chinese Llama-3 LLMs) developed from Meta Llama 3
PythonApache-2.0
Pinned issues
Issues
- 2
扩充词表后,如何做增量预训练
#108 opened by mc112611 - 1
- 1
在设置了相同seed的情况下,多次运行完全相同的脚本得到的输出不同
#107 opened by dreamingshao - 2
llama-3中文是否经过对齐训练
#103 opened by zzu-hzc - 2
- 1
跑原始的推理脚本出错,
#105 opened by dreamlychina - 1
v3 比 v2 Prompt 理解更差了
#106 opened by osabc - 0
请问有可以学习的预训练模型吗?
#111 opened by ShikangPang - 3
读取模型ggml-model-q8_0.gguf出错
#88 opened by phoenixlucky - 4
进一步预训练可以全靠lora吗
#92 opened by ymourenya - 3
中文数据集上的模型性能
#87 opened by jerrywyn - 2
关于在lora版本上继续训练的问题
#101 opened by gotimeqwer - 1
llama cpp没有GUI
#104 opened by LukeLIN-web - 2
Instruct-v3 模型融合具体是怎么融合的,以及融合的出发点的什么
#98 opened by Play2Boy - 2
已经在一张卡上执行了微调训练,现在想再执行另一个预训练,但脚本执行出错,请问要怎样修改脚本
#95 opened by czhcc - 4
想詢問一下這次的訓練程式沒有使用deepspeed的原因
#84 opened by ian08005454 - 4
基于我自己的长文本训练数据,如何训练一个中文长文本模型,最长可以支持多长的长度
#93 opened by jy-101361-1810897 - 2
- 2
模型微调时构建数据代码Output缺少终止符(输出重复)
#59 opened by fangzheng123 - 2
多机多卡训练,两机执行到这步后没有后续步骤了 [INFO|trainer.py:641] 2024-07-18 14:06:18,182 >> Using auto half precision backend
#91 opened by cc8476 - 2
多卡训练会报错 terminate called after throwing an instance of 'c10::Error' what(): CUDA error: unspecified launch failure
#89 opened by cc8476 - 3
提问不同问题显存会增加,相同问题则不会
#83 opened by Chenhuaqi6 - 3
chinese-llama-2-13b-hf可否直接用bf16继续预训练?
#85 opened by NLP-Learning - 2
请问在精调这样的数据格式是正确的吗?
#86 opened by NiniAndy - 3
如何使用非alpaca格式的数据微调如pclue?
#80 opened by lotus0903 - 4
Merging Instruct-v1 and Instruct-v2
#74 opened by HuuHuy227 - 4
llama3的分词器
#77 opened by ymourenya - 2
关于开始训练时出现了建立dataset失败事宜
#82 opened by hk63560892 - 3
multi-node inference for llama3 70b
#79 opened by Abolfazl-kr - 1
多卡训练会报错 terminate called after throwing an instance of 'c10::Error' what(): CUDA error: unspecified launch failure
#90 opened by cc8476 - 3
sft执行慢的情况,请大佬帮忙看看
#75 opened by lingaoan2024 - 2
复现该项目的精调报错,具体如图
#78 opened by hbs429469861 - 2
- 4
新人请教
#72 opened by lingaoan2024 - 7
微调时候Loss为0,grad_norm为Nan
#64 opened by aa200647963 - 2
请问什么时候能给llama3增加一个网页版demo脚本来使用
#67 opened by dasaffa - 2
模型融合
#63 opened by xiaoxiaoto - 3
请问中文对战平台是如何实现的,会开源相关代码吗
#56 opened by Infinity4B - 11
Colab中微调报错: CUDA out of memory
#44 opened by chenmonster - 2
- 2
使用 inference_hf.py 推理异常
#52 opened by Xiaoshu-Zhao - 2
如何设置多卡训练?
#51 opened by TDlemon-1900 - 3
hfl/ruozhiba_gpt4 的数据集有问题啊
#73 opened by wencan - 4
Merge完的模型在inference出錯
#69 opened by MonetCH - 1
merge lora model 时出現 error
#68 opened by MonetCH - 4
训练过程种,异常中断问题
#47 opened by AnonymousDestroyer - 2
ruozhiba数据相关,有很多并不是高质量的回答,有一些是gpt4没有发现的陷阱
#48 opened by AIchenkai - 4
- 3
MacOS(苹果M3芯片)下指令精调报错
#57 opened by yaoyonstudio - 1
checkpoint file error
#54 opened by jeffersyuan1976