Issues
- 0
- 3
- 0
- 3
alpha clip训练
#65 opened by dox012 - 1
image patch embedding
#66 opened by frederhoon - 1
if i want The training input data is four channels, because my six-channel data is spliced from six single-channel 2D images. How should I modify the code? I do not have a mask annotation file. Can I use this code?
#63 opened by watertianyi - 1
image default
#61 opened by Mxhjsz123 - 0
- 1
- 1
Guidance needed: Processing GRIT-20M dataset in .parquet format for Alpha-CLIP
#60 opened by qingpowuwu - 0
CLI environment demo of "alphaclip with LLM"
#58 opened by sunwoo76 - 0
- 2
- 2
Some of the code is not publicly available
#55 opened by eorroot - 1
Good work! How to get patch embedding of image?
#54 opened by cyysc1998 - 2
- 1
When do you release the training code?
#47 opened by oishikimchi97 - 7
- 1
UnpicklingError: invalid load key, '<'.
#44 opened by bolongliu - 2
- 1
- 1
Training Code Release
#48 opened by Zhiyuan-R - 1
Data release
#52 opened by CharlesGong12 - 2
Web Demo 502 Bad Gateway
#50 opened by CharlesGong12 - 3
- 0
- 3
- 2
The magic number of 1.9231 and 6
#28 opened by Wangt-CN - 2
Annotations of the generated Imagenet
#29 opened by callsys - 2
AhphaCLIP with llm Demo error
#39 opened by yjtlab - 5
Demo error
#38 opened by YasuoFly - 3
What data enhancements were used in AlphaCLIP?
#26 opened by DesertsP - 1
can you provided the mask of Imagenet ?
#33 opened by llf1234 - 1
Captions in GRIT
#41 opened by jiaosiyu1999 - 1
- 1
when will release alphaclip with ViT-H/14
#40 opened by akk-123 - 4
- 2
- 0
Poor performance on COCO dataset.
#36 opened by xuanpuZhao - 1
- 1
Do you consider trying Alpha-DINOv2?
#31 opened by grainseed - 2
- 2
Question: Can you provide some guidance for finetuning MLLM with alpha-clip vision encoder?
#24 opened by XuRui314 - 6
Encoding Images with Alpha Channel?
#23 opened by DavidGetter1 - 2
ViT-H/14 Model
#22 opened by zhangh0920 - 2
for one image,regardless of how the alpha channel is modified,feature similarity is consistently above 0.97 (even between mask=0 and mask=1)
#12 opened by llf1234 - 1
Alpha clip has reduced zero shooting ability compared to the original clip?
#13 opened by Originlightwkp - 11
- 1
- 2
The Alpha-clip demo with LLAVA will constantly repeat a sentence under certain specific images.
#20 opened by didiforgithub