dvlab-research/MGM
Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"
PythonApache-2.0
Issues
- 3
- 2
使用多gpu启动worker,对话时报错
#125 opened by kimi360 - 0
Request for Access to Google's Gemma models family for Medical Science Research
#139 opened by Geniusplug - 0
Update LLaMA-3 tokenizer strategy
#138 opened by ShaoTengLiu - 2
error in loading
#122 opened by TuuSiwei - 2
lora initialisation missing from builder.py
#106 opened by adrielkuek - 0
comfyUI 开始任务出现如下错误
#137 opened by jimmyyu1989 - 0
article error
#136 opened by TuuSiwei - 0
Unable to Merge LoRA Weights with Base Model: ValueError: Can't find 'adapter_config.json' at ...
#135 opened by PARSA-MHMDI - 2
TypeError: MGMllamaForCausalLM.forward() got an unexpected keyword argument 'cache_position'
#133 opened by czm708033 - 0
CHAIR Evaluation
#134 opened by itsqyh - 1
dataset miss problem
#116 opened by TuuSiwei - 16
- 1
- 1
when use stable-diffusion,AttributeError: 'NoneType' object has no attribute 'tokenize'
#131 opened by ALR-alr - 1
llama3 result is repeated many times
#130 opened by pennypengpm - 0
Does MGM support in-context(few-shot) inference?
#129 opened by waltonfuture - 0
Will there be support for Qwen2?
#128 opened by huxian0402 - 1
How to access hidden states?
#127 opened by Divyanshsingh1910 - 3
- 1
I get this error: WARNING: tokenization mismatch: 156 vs. 161. (ignored) when I finetune llama3
#126 opened by shidingz - 1
May I ask if the current inference code does not support multi images input
#114 opened by Angelalilyer - 3
loss 0 and grad nan
#123 opened by TuuSiwei - 2
- 1
| EORROR | stderr | RecursionError: Maximun recursion depth exceeded in comparison
#115 opened by linyf38 - 0
Do you meet the error "MGMConfig"?
#124 opened by strawberryrs620 - 0
- 0
Can provide laion-gpt4v dataset images zip?
#120 opened by TuuSiwei - 1
关于多机多卡效果不如单机多卡好的问题
#111 opened by DePengW - 1
Inference problem about the demo.
#118 opened by ApolloRay - 1
The data for alignment and finetuning contains duplicates. Can you please explain why this is happening?
#117 opened by KANGRuipeng - 4
可以放一下生成generation_pure_text数据的代码吗
#109 opened by pennypengpm - 1
Generation-related Instructions dataset link
#112 opened by berry-ding - 0
多轮对话修改图像输入后报错
#113 opened by pennypengpm - 0
Loss does not decrease
#110 opened by yfthu - 1
- 2
Which deepspeed version is it
#91 opened by Kareneveve - 0
LLama 70B support
#108 opened by PrateekPal641 - 0
Inference speed
#107 opened by PrateekPal641 - 2
Some weights of the model checkpoint were not used when initializing MGMLlamaForCausalLM
#103 opened by charlesCXK - 1
how to use stage2 ckpt fine-tuning stage3?
#102 opened by linqinguang - 1
Excessive Length of Responses from Mini Gemini
#97 opened by Dopplenum - 1
Use of ocr in Evaluation
#95 opened by bruceisme - 1
- 0
计划加入DPO训练来缓解模型幻觉问题吗
#101 opened by jiezhangGt - 2
- 2
Take input image as condition.
#100 opened by Adenialzz - 1
stage2 loss is 0
#98 opened by jiezhangGt - 1
- 2
当我使用推理命令的时候出现网络错误,无法构建推理的接口
#90 opened by HongLouyemeng