DLYuanGod/TinyGPT-V

TinyGPT-V: Efficient Multimodal Large Language Model via Small Backbones

PythonBSD-3-Clause

Issues

Certainly, here is the translation: "LAION dataset download issue"
#34 opened 2 months ago by Victorino-L
0
when I use phi-2 model，output is “the the”
#11 opened 9 months ago by yuxiaoranyu
17
请问支持中文数据微调/推理吗？
#31 opened 7 months ago by LianghuiGuo
2
FileNotFoundError: [Errno 2] No such file or directory: 'prompts/alignment.txt'
#27 opened 8 months ago by yanghu819
2
llama_model: "/root/autodl-tmp/phi-new" which vision of phi is used in the stage4 training process?
#33 opened 5 months ago by yzc-ippl
0
stage4 train torch.cuda.OutOfMemoryError: CUDA out of memory.
#32 opened 7 months ago by ovvo20
0
RuntimeError: Failed to fetch any data from dataloader after refresh.
#30 opened 7 months ago by ovvo20
1
How much does stage3 learning affect performance?
#28 opened 8 months ago by tosiyuki
1
Phi-2 problem
#26 opened 8 months ago by dexmac221
8
attention mask and the pad token id were not set
#29 opened 8 months ago by sticktoFE
0
How to instruct fine tune it on a custom dataset ?
#20 opened 9 months ago by RamziRebai
1
Could not create share link.
#25 opened 8 months ago by chygoa
0
Why test answer is always change in the same setting?
#24 opened 8 months ago by likakakaka
2
Size mismatch for stage1 checkpoints
#22 opened 8 months ago by wangskyone
1
Phi-2 checkpoint in the readme does not fully initialize the Phi-2 model
#10 opened 9 months ago by VovaTch
8
On the stage 4，the Q-Former is trained, however the Q-Former should not been trained in the paper
#23 opened 9 months ago by TangYuan96
2
Error while processing
#21 opened 9 months ago by DaBaiTuu
2
connection refused and not create share link
#18 opened 9 months ago by beautyhansonboy
2
Attempting to unscale FP16 gradients
#17 opened 9 months ago by sunzhe09
7
能在autodl社区分享一下你们的镜像吗
#19 opened 9 months ago by xiaoxue-roy
0
REC Results
#16 opened 9 months ago by tydia
1
performance evaluation details
#15 opened 9 months ago by dylanqyuan
1
Typo in paper
#14 opened 9 months ago by hu-po
1
What model should be used here？
#7 opened 9 months ago by VacantHusky
2
download from huggingface failed
#12 opened 9 months ago by DaBaiTuu
5
The first stage training, lora is included.
#13 opened 9 months ago by zr-icu
1
BLIP-2 / Q-Former / Benchmarks
#8 opened 9 months ago by Jotschi
2
Target modules {'query_key_value', 'dense'} not found in the base model
#9 opened 9 months ago by FourWinds021
5
Thank you!
#6 opened 9 months ago by miolini
0
'PhiAttention' object has no attribute 'q_layernorm'/ 'k_layernorm'/ 'k_postnorm'
#5 opened 9 months ago by zr-icu
2
Unable to run on a remote jupyter environment
#4 opened 9 months ago by twilwa
2
How long does it take to train?
#3 opened 9 months ago by zxti
1
Nice Work!
#1 opened 9 months ago by xiaoachen98
0