jianzhnie/LLamaTuner
Easy and Efficient Finetuning LLMs. (Supported LLama, LLama2, LLama3, Qwen, Baichuan, GLM , Falcon) 大模型高效量化训练+部署.
PythonApache-2.0
Issues
- 0
该项目与qlora的差别
#100 opened by lixiaoxiangzhi - 0
- 0
不同样式的样本对应什么样的情形,如何根据自己的需求选择样本的样式
#101 opened by lixiaoxiangzhi - 0
这么好的项目怎么没有issues,我先赞一个
#3 opened by nieallen - 2
QLORA微调alpaca_data.json报错 'padding_value' (position 3) must be float, not NoneType
#96 opened by kanxueli - 1
总是这个错误怎么解决
#93 opened by liumaohui - 1
llama-2-13b的模型用单卡跑lora就会报错
#95 opened by Enoch202 - 0
批量推理时结果异常
#98 opened by ZayIsAllYouNeed - 1
微调后的Llama-2-7b,在模型加载时出错
#97 opened by kanxueli - 6
faile on 3090
#4 opened by SeekPoint - 2
[问题]有关训练可视化
#92 opened by RickMeow - 2
About llama-2-70B fine-tuning
#91 opened by RickMeow - 2
zero3保存的模型无法加载
#90 opened by charliedream1 - 1
如何使用自己的数据集
#89 opened by clannadcl - 6
- 3
下载了百川7b模型后,直接在gradio_webserver.py里推理,生成内容乱码问题
#78 opened by FDwangchao - 1
llama2-13B和llama2-70b微调所需要的显卡配置
#87 opened by batindfa - 4
训练成功了,但是没有合并的脚本,请问如何合并?
#7 opened by apachemycat - 3
ValueError: Undefined dataset tatsu-lab/alpaca
#81 opened by SeekPoint - 4
请问支持Baichuan 13B吗?
#56 opened by mynewstart - 3
单机多卡并行训练报错
#69 opened by wgzhendong - 1
Baichuan7B使用lora微调后测试时总会再次输出query
#73 opened by GaoXinJian-USTC - 13
多卡似乎不能将每张卡跑满,请问如何才能让每张卡的计算负载跑满呢
#66 opened by RayneSun - 1
Would you support RWKV?
#57 opened by xiaol - 1
中文文档里没写对baichuan-13B的支持,但英文写了
#62 opened by RayneSun - 4
- 1
多卡加速支持evalution吗
#61 opened by greenriver777 - 1
data_utils.py 是不是有问题?
#59 opened by wgzhendong - 1
32g内存+3060ti6G显存可以finetune 7B的模型吗?
#54 opened by TodayWei - 1
baichuan-7B: AttributeError: 'CastOutputToFloat' object has no attribute 'weight'
#39 opened by franciszhang92 - 1
RuntimeError: probability tensor contains either `inf`, `nan` or element < 0
#6 opened by weifan-zhao - 4
百川7B 模型微调结果
#30 opened by jianzhnie - 3
- 4
How to adapt QLoRA to other base models?
#9 opened by RanchiZhao - 3
4bit loaded error
#2 opened by yuxuan2015 - 0
合并的Bug修复,等会就出来验证结果了
#14 opened by apachemycat - 1
合并代码还有一个错误,中间少了一个变量
#12 opened by apachemycat - 1
8bit和4bit训练效果对比有吗
#8 opened by kongjiellx - 1
utils/apply_lora.py 有个小Bug
#10 opened by apachemycat - 4
RuntimeError: probability tensor contains either `inf`, `nan` or element < 0
#5 opened by weifan-zhao - 1
What's the real LICENSING
#1 opened by jiacheo