dvlab-research/LISA
Project Page for "LISA: Reasoning Segmentation via Large Language Model"
PythonApache-2.0
Issues
- 0
Is LISA-7B-v1 finetuned on ReasonSeg?
#175 opened by DanielSHKao - 3
About training environment!!!
#151 opened by jifeng35 - 1
How to Resume
#165 opened by HIT-liar - 2
Can i use 4 24 G GPU for this project fine tuning
#147 opened by aliman80 - 1
- 1
Inference produces blank segmentation masks
#174 opened by vanshikavats9 - 1
How to train my datesets?
#140 opened by Amark-cheey - 5
can't load LISA++
#173 opened by victoriazinkovich - 11
Can't reproduce the results of lisa-7b
#135 opened by L1uShuai - 0
I'd like to ask about methods to force the model to generate the <SEG> token.
#170 opened by minjung98 - 3
- 2
Mask image seems to be nothing at all
#160 opened by JesseZZZZZ - 2
Why cat 255 tokens to seg_token_mask?
#172 opened by zhang-yige - 0
> > LLaVA-1.5 uses 336px image resolution, so you should change the clip model and control max context length. Also, the image token length is set to 256 by default, but when the resolution is changed to 336, the image token length should be set to 576. Overall, some implementation details need further consideration to adapt to llava-1.5. You should check that in detail.
#163 opened by Amark-cheey - 0
关于结合LLM与SAM进行特定物体分割的咨询
#171 opened by Linn0910 - 0
Some questions about finetuning and LISA++
#169 opened by chris5zk - 0
Model with Mask2Former
#168 opened by stdrr - 1
step vs epoch
#141 opened by dohyun1411 - 0
can it accept video input?
#167 opened by ljh-7 - 9
Paper provided example can not be reproduced !!
#153 opened by HuangChiEn - 0
Stuck at Loading checkpoint shards 100%
#166 opened by Xamexer - 0
How to Resume
#164 opened by HIT-liar - 1
Training on Pretrained LISA Model
#159 opened by Mactor2018 - 0
Result Reproduction Problem on Refcoco Dataset
#162 opened by fazhdo - 0
请问会考虑开源lisa++的权重和数据集吗
#161 opened by leftdoor - 0
About Dataset reason_seg/explanatory
#158 opened by asad14053 - 0
- 0
Request for the latest code of LLaVA 1.5
#156 opened by Bilibilee - 0
RuntimeError: CUDA error: unknown error
#154 opened by Shaosifan - 2
How many samples were iterated in total?
#131 opened by Syn000 - 1
请问python版本是什么
#152 opened by Issac304 - 0
Supplementary material
#150 opened by mtchiu - 0
Curious about the multi-turn conversation demo
#149 opened by chris5zk - 0
Is the demo link wrong ?
#148 opened by HuangChiEn - 0
[seg] token inference
#145 opened by xiexing0916 - 0
Supplementary material of the accepted paper
#144 opened by yo3nglau - 0
How to evaluate on RefCOCO datasets ?
#143 opened by Z-MU-Z - 1
Token indices sequence length is longer than the specified maximum sequence length for this model (565 > 512). Running this sequence through the model will result in indexing errors
#128 opened by Roberyan - 0
IndexError: The shape of the mask [6, 440] at index 1 does not match the shape of the indexed tensor [6, 760, 256] at index 1
#142 opened by Oseemaker - 2
flash_attn
#137 opened by nightbreezecool - 0
Inference speed on refcocog dataset
#126 opened by Yting68 - 2
Is LISA model on Hugginface?
#127 opened by gauravsh0812 - 0
Finetuning Projection Layer or Not
#139 opened by ziqipang - 0
LISA-7B-v1&& LISA-7B-v1-explanatory
#138 opened by nightbreezecool - 0
datasets download links
#136 opened by nightbreezecool - 0
Incorect online demo link
#134 opened by dohyun1411 - 2
LISA model_forward bug, only occurs when inference.
#132 opened by Roberyan - 0
Code error about scheduler in train_ds.py
#133 opened by eeeric-code - 0
Why apply the causal mask to image tokens in the attention operations of LLM?
#130 opened by yubin1219 - 0
How to generate ReasonSeg Dataset
#129 opened by ZhixiongSun