baichuan-inc/Baichuan2
A series of large language models developed by Baichuan Intelligent Technology
PythonApache-2.0
Issues
- 6
Baichuan2 Chat Template
#392 opened by YanshekWoo - 2
baichuan2-13B-chat 微调loss 一直为0
#401 opened by sunjinguo - 1
微调token长度的问题
#384 opened by jichaoqun - 0
训练和推理时数据格式不同
#408 opened by whl2333 - 2
base模型推理pred和inputs完全一样
#394 opened by shuiyigt - 0
- 0
我使用了lora微调训练的4个epoch,但是模型还没有收敛,如何从保存的checkpoint继续训练
#406 opened by XuKun2424 - 1
Baichuan2-7B-Base微调报错 AttributeError: 'BaichuanConfig' object has no attribute 'z_loss_weight'AttributeError: 'BaichuanConfig' object has no attribute 'z_loss_weight'
#395 opened by qingchen177 - 0
数据集
#405 opened by Coir-hat-man - 0
请问模型 Baichuan2-13B-Chat-4bits 支持MAC吗?
#404 opened by IguoChan - 0
baichuan2-13b 微调后模型使用vllm输出与官方web_demo结果不一致
#403 opened by kingduxia - 2
Baichuan2-13B-Chat-4bits 跑不起来
#400 opened by you567 - 0
baichuan2-7B-chat 微调使用TrainerCallback,报错
#402 opened by JackMeiLong - 1
使用fastgpt框架对接baichuan2需要流式接口,请求支持
#398 opened by hhtao - 0
- 3
- 1
使用fastgpt需要流式接口,请求支持
#397 opened by hhtao - 0
LLM相同输入,多次输出不一样
#396 opened by N-Kingsley - 3
Baichuan2-7B-Chat-4bits用windows环境cpu跑不起来
#377 opened by qingchen177 - 2
13B-chat微调训练每一步训练时长很长
#362 opened by KevinFan0 - 0
Baichuan2 7B和13B的模型训练数据和数据的训练顺序是否一致?
#393 opened by txy77 - 0
请问是否有办法能扩大输入窗口到8k呢?
#391 opened by twwch - 1
使用LLAMA 自定义数据集训练Baichuan2-7B-Chat 回答语无伦次到底是什么问题?
#357 opened by TzyTman - 1
调用接口时CPU100%
#388 opened by liuzongyang255 - 0
请问有开源1B左右模型的计划吗
#389 opened by Yuhuajoe - 1
Baichuan2-7B-Base中训练后显存翻倍问题
#387 opened by Mr-KenLee - 0
api文件管理文件上传接口返回错误码500
#386 opened by kunkun8866 - 0
如何将Baichuan2-13B-chat模型转化到baichuan1形式
#385 opened by guoqiangqi - 0
- 1
- 2
baichuan2-13b-4bits离线模型的quant_state存储类型错误
#367 opened by sxndqc - 0
Baichuan2-13b-chat-v2版本训练无法进行
#381 opened by Taskii-Lei - 0
更改模型的自我介绍话语
#379 opened by wuQi-666 - 0
streamlit run ./web_demo.py 能运行,但浏览器访问报错: WebSocket connection to 'ws://xx.xx.xx.xx:xxx/_stcore/stream' failed:
#378 opened by DaihuaWei - 4
baichuan-7B-chat-4bits跑不通
#366 opened by Yining0907 - 4
baichuan2 v2-8k 和v1-4K的区别
#368 opened by DSXiangLi - 0
持续训练失败
#375 opened by lichenyigit - 0
Baichuan 2 (7B)没有原生支持昇腾 NPU 推理
#374 opened by AnitaSherry - 0
如何使用华为昇腾AI服务器进行NPU推理部署
#373 opened by water-2022 - 3
一个垃圾,瞎吹牛逼不说,连个维护的鸟人也没有
#371 opened by qianma819 - 2
- 0
baichuan2-13b v1.0版本推理出错
#372 opened by yuege613 - 1
不提交上次回话,同一句话问多次,有可能出现不同的结果,是啥原因,配置错了吗?
#365 opened by zbsean - 0
单机5张3090卡微调内存不足,但单机单卡a6000可以微调
#370 opened by fushun1990 - 1
请问你们的baichuan2模型训练了多少种语言呢
#364 opened by fxb392 - 1
13B微调如何提升模型知识记忆能力
#363 opened by K-Alex13 - 1
运行需要什么样的python环境?提示xFormers版本问题
#359 opened by qianma819 - 0
baichuan2-13B-chat,生成速度慢,输出时是乱码,十几个字符后程序就蹦了,求助原因
#361 opened by blueskyban - 0
合并后的模型chat时报错:generation_utils.py unsupported operand type(s) for -: 'int' and 'NoneType
#360 opened by growmuye - 2
Baichuan 2 支持昇腾 NPU 推理
#358 opened by ssm0808