mbzuai-oryx/groundingLMM
[CVPR 2024 🔥] Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses that are seamlessly integrated with object segmentation masks.
Python
Issues
- 0
Issues in Shape Handling and Original Image Size
#79 opened by gongshichina - 0
can't train with the dataset(--use_reg_data --reg_dataset 'VisGen_Reg' --val_dataset 'VisGenomeRegVal')
#78 opened by zhenze2 - 0
can't eval
#77 opened by zhenze2 - 1
Online Demo is down again
#73 opened by HAL-42 - 0
app.py error
#76 opened by hanzefang - 1
- 2
Region input for demo
#75 opened by 582383982 - 1
Offline demo error
#66 opened by Roberyan - 1
Potential Data Leakage?
#74 opened by joshmyersdean - 5
- 1
Online Demo Down
#51 opened by Shengcao-Cao - 0
Understanding predicting multiple masks
#72 opened by joshmyersdean - 2
can not install mmcv
#55 opened by zhipeixu - 1
OOM Error on 8xA100
#71 opened by joshmyersdean - 1
How can I finetune on combined tasks?
#62 opened by Rubics-Xuan - 2
The training losses in the GCG task
#33 opened by gWeiXP - 0
- 1
Why is it that during the computation of segmentation results, the model() function is used instead of model.generate()? Wouldn't this mean that when predicting the next token, the information viewed is from the actual token rather than the predicted one?
#67 opened by L1uShuai - 0
How to Construct a Ground-Truth Test Dataset
#69 opened by hungnh1125 - 0
What are the ‘categories’ in the dataset used for? When would I use them?
#68 opened by bibibabibo26 - 0
AssertionError when running a demo
#65 opened by suikei-wang - 1
Confusing referring segmentation results.
#63 opened by ZhimaoPeng - 1
mmcv failed to install
#64 opened by suikei-wang - 1
- 0
How should I train on the GranD dataset
#61 opened by Fangyi-Chen - 1
training V-L and L-P projection layer
#58 opened by remvanthull - 2
Training on New Data
#57 opened by ajb8866 - 2
About region caption
#48 opened by mu-jin-meng - 2
3D implementation of GLaMM
#46 opened by remvanthull - 3
- 2
token_positives
#53 opened by Pumpkin123709 - 1
assertion error cur_len == total_len
#54 opened by bibibabibo26 - 0
- 3
Question about Output Quality Difference Between Local and Online Demo for MBZUAI/GLaMM-FullScope
#39 opened by Jayce1kk - 4
- 0
Some bugs in the GranD_ReferringSegm_ds.py
#50 opened by xandery-geek - 0
may i ask your total parameter?
#49 opened by CoderZhangYx - 1
local llm interface for glamm
#44 opened by whpy - 1
- 1
Fluctuate results on RefCOCO Family when evaluating the referring expression segmentation.
#42 opened by Glupayy - 5
An error is reported when running eval
#41 opened by clevercaicai - 7
GrandD Detailed Operation Guide
#35 opened by hzdzkjdxyjs - 9
Release of pre-training instructions?
#31 opened by remvanthull - 2
Grand-env
#36 opened by hzdzkjdxyjs - 1
A bug in region captioning evaluation scripts
#40 opened by machuofan - 2
About GranD Pre-training Dataset
#37 opened by Shengcao-Cao - 1
the demo caption is very simple
#38 opened by trouble-maker007 - 2
- 1
- 0
Phrase grounding model
#29 opened by ekazakos