salesforce/LAVIS

LAVIS - A One-stop Library for Language-Vision Intelligence

Jupyter NotebookBSD-3-Clause

Issues

build dependencies did not run successfully because there is no compatible version of numpy
#754 opened 3 months ago by 7w01
4
RuntimeError: Error(s) in loading state_dict for Blip2OPT: size mismatch for opt_proj.weight: copying a param with shape torch.Size([2560, 768]) from checkpoint, the shape in current model is torch.Size([768, 768]). size mismatch for opt_proj.bias: copying a param with shape torch.Size([2560]) from checkpoint, the shape in current model is torch.Size([768]).
#773 opened 24 days ago by chilljudaoren
0
Issue related to BLIP2 `CaptionDataset` implementation or `blip2_qformer.py` for custom dataset pre-training stage 1
#772 opened a month ago by abdel-habib
1
RuntimeError: shape mismatch: value tensor of shape [131072] cannot be broadcast to indexing result of shape [0]
#771 opened a month ago by jyh-fort
0
Environmental Error
#755 opened 3 months ago by returnaaa
1
text and image embedding from xgen-mm-phi3-mini-base-r-v1.5
#770 opened a month ago by sangeethkumar1997
0
ImportError: numpy.core.multiarray failed to import
#762 opened 2 months ago by lvilaca16
10
blip2_feature_extractor problem?
#769 opened a month ago by wahaha-debug
0
The odd captions for inputing pure black/white picture to the BLIP model
#768 opened a month ago by FaxinZ
0
Several issues regarding the BLIP3 code
#735 opened a month ago by suikei-wang
6
`ImportError: numpy.core.multiarray failed to import` when trying to use salesforce-lavis in Huggingface app
#767 opened a month ago by jchwenger
0
Multi GPU training is stalling with 100% GPU utilisation.
#747 opened 3 months ago by SouravMzdr
3
BLIP3 inference error
#749 opened 3 months ago by josephzpng
2
OSError: Can't load tokenizer for 'google/flan-t5-xxl'.
#766 opened 2 months ago by JoycePerm
0
Reproducing InstructBLIP on Flickr30K
#719 opened 6 months ago by gabrielsantosrv
2
blip2 feature
#765 opened 2 months ago by cqray1990
0
hasattr(dataset[split], "coco_fmt_qust_file"), KeyErrror: "val"
#764 opened 2 months ago by ArkZero35
0
AttributeError: 'NoneType' object has no attribute 'from_pretrained'
#724 opened 5 months ago by lyx-JuneSnow
2
A slip of the pen concerning paper URL in README
#763 opened 2 months ago by whyisverysmart
0
Can instructBLIP realize context learning? Give some sample pictures and sample descriptions in advance, and then generate descriptions of the same style according to the new pictures? )
#761 opened 2 months ago by yuanllong
0
Will blip-diffusion release the training code?
#759 opened 2 months ago by Hiccupwzy
0
transformers version support
#757 opened 2 months ago by YeonjeeJung
2
running error
#742 opened 4 months ago by junqiangchen
2
Question for the meaning of BLIP2 embedding.
#758 opened 2 months ago by Roberyan
0
Why blip2_feature_extraction does not align with demo
#756 opened 3 months ago by Roberyan
0
Inquiry about sharing `open_flamingo.train.data` used in BLIP-3
#751 opened 3 months ago by weiyao-Wang
2
LoRA training for bilp3
#753 opened 3 months ago by josephzpng
0
Is it possible to use blip-2 model without the LAVIS env?
#752 opened 3 months ago by Roberyan
0
Isn't it possible to adapt GradCAM in VQA Models?
#750 opened 3 months ago by V2LLAIN
0
xgen-mm（BLIP-3）推理出错
#740 opened 4 months ago by Gaojinpeng8
2
How can I provide some examples to BLIP2 and InstructBLIP models?
#748 opened 3 months ago by xukefaker
1
Multiple process hang by `torch.distributed.barrier()`
#738 opened 4 months ago by Tllokn
1
Garbled code appears when reasoning 7B model locally.
#746 opened 4 months ago by yuanllong
0
在自己数据集上运行blip2的第一个阶段
#745 opened 4 months ago by sdwulxr
0
Run BLIP-3 on Silicone Mac?
#744 opened 4 months ago by starush
0
about instructBlip run scripts file
#729 opened 5 months ago by jiinhui
1
Prompt for grounding captioning (BLIP3)
#743 opened 4 months ago by yusijin02
0
blip3-ocr and blip3-grounding links are not working
#733 opened 4 months ago by cooleel
3
In the second pre-training stage of BLIP2 (i.e., generative learning stage), isn't it true that only Q-Former's image transformer module is involved in the training, while Q-Former's text transformer module is not?
#739 opened 4 months ago by chengsiyu24
0
Request for Example: Fine-tuning BLIP-3 on an Interleaved Text-Image Dataset
#736 opened 4 months ago by LostXine
2
How to download pretrained_ckpt="/path/to/the/base/model.pt" ?
#737 opened 4 months ago by HuichiZhou
2
Hello~ When will the dataset blip3-grounding-50m be released? Found 404 error of the link: https://huggingface.co/datasets/Salesforce/blip3-grounding-50m
#734 opened 4 months ago by Moonteresa
1
您好，在运行instructblip的run_damo时候遇到如下问题，但是并未找到设置repo_type的地方
#728 opened 5 months ago by Luguire
0
what is the meaning of db
#727 opened 5 months ago by mshmoon
0
error: casting &T to &mut T is undefined behavior, even if the reference is unused, consider instead using an UnsafeCell --> tokenizers-lib/src/models/bpe/trainer.rs:526:47
#726 opened 5 months ago by Sanjay-kc
1
The Lavis downloaded using pip in Colab lacks the vicuna 7b or 13b related file for Instructblip.
#725 opened 5 months ago by Lt200
0
KeyError: "Name 'captioning' already registered for <class 'lavis.tasks.captioning.CaptionTask'>." How to fix this error?
#722 opened 6 months ago by RamyaSreeChippagiri
0
Differences between pretrain stage2 and finetuning for BLIP2 COCO Captioning
#721 opened 6 months ago by cjerry1243
0
Evaluation issues encountered while fine-tuning blip2
#720 opened 6 months ago by fjt12138
0
Why would you overload the load_state_dict() function in blip2_vicuna_xinstruct.py?
#718 opened 6 months ago by Sampson-Lee
0