zq-zang's Stars
KennithLi/Awesome-Zero-Shot-Object-Detection
Dao-AILab/flash-attention
Fast and memory-efficient exact attention
xing61/zzz-api
优质稳定的OpenAI的API接口-For企业和开发者。OpenAI的api proxy,支持ChatGPT的API调用,支持openai的API接口,支持:gpt-4,gpt-3.5。不需要openai Key, 不需要买openai的账号,不需要美元的银行卡,通通不用的,直接调用就行,稳定好用!!智增增
QwenLM/Qwen-VL
The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.
ChenDelong1999/RemoteCLIP
🛰️ Official repository of paper "RemoteCLIP: A Vision Language Foundation Model for Remote Sensing" (IEEE TGRS)
djiajunustc/TransVG
facebookresearch/Detic
Code release for "Detecting Twenty-thousand Classes using Image-level Supervision".
xiaoyuan1996/SemanticLocalizationMetrics
The first research for semantic localization
alirezazareian/ovr-cnn
A new framework for open-vocabulary object detection, based on maskrcnn-benchmark
PaddlePaddle/PaddleDetection
Object Detection toolkit based on PaddlePaddle. It supports object detection, instance segmentation, multiple object tracking and real-time multi-person keypoint detection.
ysy31415/unipaint
Code Implementation of "Uni-paint: A Unified Framework for Multimodal Image Inpainting with Pretrained Diffusion Model"
DPS2022/diffusion-posterior-sampling
Official pytorch repository for "Diffusion Posterior Sampling for General Noisy Inverse Problems"
huggingface/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
AUTOMATIC1111/stable-diffusion-webui
Stable Diffusion web UI
Baijiong-Lin/LoRA-Torch
PyTorch Reimplementation of LoRA (featuring with supporting nn.MultiheadAttention)
om-ai-lab/RS5M
RS5M: a large-scale vision language dataset for remote sensing [TGRS]
extreme-assistant/Deep-learning-datasets
整理分类深度学习各方向公开数据集
shikras/d-cube
A detection/segmentation dataset with labels characterized by intricate and flexible expressions. "Described Object Detection: Liberating Object Detection with Flexible Expressions" (NeurIPS 2023).
mlvlab/RALF
Official implementation of CVPR 2024 paper "Retrieval-Augmented Open-Vocabulary Object Detection".
longzw1997/Open-GroundingDino
This is the third party implementation of the paper Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection.
MasterBin-IIAU/UNINEXT
[CVPR'23] Universal Instance Perception as Object Discovery and Retrieval
lichengunc/refer
Referring Expression Datasets API
CircleRadon/Osprey
[CVPR2024] The code for "Osprey: Pixel Understanding with Visual Instruction Tuning"
shikras/shikra
OFA-Sys/OFA
Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework
OpenGVLab/VisionLLM
VisionLLM Series
Vision-CAIR/MiniGPT-4
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
SkyworkAI/Vitron
NeurIPS 2024 Paper: A Unified Pixel-level Vision LLM for Understanding, Generating, Segmenting, Editing
microsoft/LoRA
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
huggingface/peft
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.