Pinned Repositories
DeCap
ICLR 2023 DeCap: Decoding CLIP Latents for Zero-shot Captioning
LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Qwen-VL-Lora-Model
可以成功Lora微调的Qwen-VL模型
mmrotate
OpenMMLab Rotated Object Detection Toolbox and Benchmark
Qwen-VL
The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.
LSKNet
(IJCV2024 & ICCV2023) LSKNet: A Foundation Lightweight Backbone for Remote Sensing
CrazyBrick's Repositories
CrazyBrick/DeCap
ICLR 2023 DeCap: Decoding CLIP Latents for Zero-shot Captioning