Issues
- 0
Details missing in the paper
#46 opened by Masaaki-75 - 0
- 4
is this still being worked on?
#28 opened by CoffeeShifter - 0
how to disable bf16 during inference?
#44 opened by nicolaus-huang - 12
Inference Problem
#42 opened by LKAMING97 - 1
Inference Problem
#43 opened by URRealHero - 5
Multimodal-in and multimodal-out
#18 opened by JoyBoy-Su - 0
Open-sourcing the training data
#41 opened by findalexli - 1
Question about the training data
#37 opened by Epiphqny - 2
Inference
#40 opened by jungao1106 - 5
The effect is very poor after training
#31 opened by coder4nlp - 0
Help with the transformers version
#39 opened by Raman1121 - 1
Question about Multimodal Input
#38 opened by ZixianGao - 1
- 0
Can Chinese caption data be used for finetune?
#36 opened by coder4nlp - 0
Should update Transformers version
#33 opened by xinlong-yang - 4
- 12
questions about the image generation?
#9 opened by mutonix - 1
where is vqgan decoder coming from
#32 opened by Yuheng-Li - 1
Support Hugging Face native support
#14 opened by JoyBoy-Su - 4
nan error running your example
#23 opened by MeNicefellow - 0
Some question(maybe bugs) about training
#29 opened by Mr-Loevan - 1
Inf Loss Problem When Training
#30 opened by nreHieW - 1
- 5
Quantized Model
#17 opened by JoyBoy-Su - 6
question about the image understanding
#25 opened by df2046df - 3
- 1
- 2
- 1
Improve model performance
#15 opened by JoyBoy-Su - 4
- 0
How to prepare training data?
#20 opened by hxdtest - 4
- 0
Evaluation
#19 opened by JoyBoy-Su - 2
Initialize and pre-train a smaller model
#10 opened by win10ogod - 0
- 1
- 2
Transformers changes
#12 opened by winglian - 3
questions on the released model and training
#8 opened by ilovecv - 2
paper?
#2 opened by JoshonSmith - 2
- 1
- 3
bad results!
#5 opened by yyyouy - 2