InternLM/xtuner

An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)

PythonApache-2.0

Pinned issues

报名参加书生·浦语大模型实战营——两周带你玩转微调部署评测全链路

#282 opened 5 months ago by vansin

Open3

Issues

运行llava-llama-3-8b-v1_1报错：FileNotFoundError: can't find *_optim_states.pt files in directory
#674 opened a month ago by Egber1t
4
how can I continue fine-tuning the model after SFT on my own data
#724 opened 12 days ago by xsx1001
2
past_length is None
#722 opened 13 days ago by vincent507cpu
3
精调停止在这里，什么问题？
#718 opened 15 days ago by chalesguo
2
error occur when run xtuner chat for single image inference
#727 opened 12 days ago by J0eky
2
how to train multi tasks on different gpus at the same time?
#726 opened 12 days ago by ztfmars
0
question on training on multi gpus
#725 opened 12 days ago by ztfmars
0
Errors of llava pretrain for phi3_mini_4k_instruct_clip_vit_large_p14_336
#713 opened 17 days ago by JiamingLv
2
error while finetune llava_llama3_8b_instruct_qlora_clip_vit_large_p14_336_e1_gpu1_finetune_copy.py on my own data
#723 opened 12 days ago by J0eky
0
指定模型本地路径后仍然需要下载模型
#721 opened 13 days ago by vincent507cpu
0
Execution exits unexpectedly: llava_llama3_8b_instruct_qlora_clip_vit_large_p14_336_e1_gpu1_finetune
#717 opened 15 days ago by chalesguo
1
执行 NPROC_PER_NODE=2 xtuner train /root/StableDiffusionGPT/config/internlm2_1_8b_qlora_alpaca_e3_copy.py --work-dir /root/test/ft/train --deepspeed deepspeed_zero2 指令运行报错
#719 opened 14 days ago by LTtt456c
1
what's purpose of dispatch
#716 opened 16 days ago by shockjiang
1
目前支持多模态LLM的微调吗，比如qwen-vl、internLM-VL等
#720 opened 13 days ago by ybshaw
1
convert pth_to_hf 时报告 NotImplementedError: Cannot copy out of meta tensor
#702 opened 22 days ago by mikewin
2
introduce cogvlm2
#714 opened 17 days ago by Jayantverma2
2
xtuner转化模型权重文件是否支持多卡
#715 opened 16 days ago by AlittlePIE
8
支持imagebind多模态大模型微调或者训练吗？
#711 opened 16 days ago by lewis-ing
1
could you provide a internlm2 1.8B llava model
#712 opened 17 days ago by shockjiang
1
OSError: We couldn't connect to 'https://huggingface.co' to load this file
#710 opened 19 days ago by Artist2001
16
华为Ascend NPU卡训练的时候，需要设置哪些参数？
#709 opened 19 days ago by apachemycat
1
长序列训练的疑惑？
#703 opened 19 days ago by RyanOvO
1
ERROR: Could not find a version that satisfies the requirement mpi4py-mpich (from xtuner[all]) (from versions: none)
#704 opened 19 days ago by eugine5
2
多卡微调报错
#695 opened 24 days ago by rourouZ
9
请问在32k的长度下微调7B模型，大概需要多少显存呢？
#707 opened 20 days ago by hxujal
0
支持Yi-1.5系列Chat模型
#698 opened 20 days ago by thomas-yanxin
10
llava-llama-3-8b-v1_1-hf How to quantify awq?
#699 opened 23 days ago by goodnight654
1
合并llama3时出现如下报错，这个问题再使用zero3时也出现了
#686 opened 20 days ago by 1518630367
5
Plans to support InternLM-XComposer
#705 opened 20 days ago by babla9
0
使用xtuner进行LLaVA训练如何混合纯文本数据
#701 opened 21 days ago by thomas-yanxin
0
关于多轮对话loss mask计算的疑问
#700 opened 21 days ago by RyanOvO
1
数据在入过程中样本量减少
#682 opened 23 days ago by Jason8Kang
2
The sequence parallel is open when I don't use it.
#669 opened a month ago by amulil
3
请问chatglm3的lora微调需要多大显存呀
#694 opened 24 days ago by Franklin-L
3
求助max_epochs 的问题
#677 opened 23 days ago by Franklin-L
2
log输出中的time的意思
#687 opened 23 days ago by shockjiang
3
支持自定义视觉编码器么（llava-llama3）?
#668 opened a month ago by Yanllan
2
any support plan for llava-llama3-70b/12b? any guide for Module optimization ?
#689 opened 25 days ago by ztfmars
2
无法启动训练，似乎是mmengine有问题
#691 opened 25 days ago by Dominic23331
1
warning on CUTLASS&sparse_attn&triton
#676 opened 23 days ago by shockjiang
1
权重载入
#688 opened a month ago by zwhus
1
使用官方脚本对应数据集提示列名不匹配，使用同格式自定义数据集报错
#692 opened 24 days ago by LumenScope
2
shape mismatch when loading llava-phi path
#681 opened a month ago by shockjiang
2
报错FileNotFoundError: [Errno 2] No such file or directory: '/app/work_dirs/chatglm2_6b_qlora_lawyer_e3_copy/20240514_035914/vis_data/eval_outputs_iter_499.txt'
#685 opened a month ago by rcejzibjks38
1
如何再8*A100上预训练128k长度的llama3？
#683 opened a month ago by 1518630367
2
safetensors文件权重
#680 opened a month ago by WEXIJUE
0
support flash-attn in Phi3
#679 opened a month ago by shockjiang
0
LLaVAModel finetuning with llava-phi-3-mini-xtuner
#671 opened a month ago by shockjiang
5
Add a feature to merge the PTH format with the original LLM directly.
#670 opened a month ago by LiyanJin
0
xtuner mmbench on xtuner/ lava-Llama 3-8b-v1_1-hf
#673 opened a month ago by Yanllan
1