open-vocabulary-detection

There are 26 repositories under open-vocabulary-detection topic.

IDEA-Research/Grounded-Segment-Anything
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
Language:Jupyter Notebook15.4k 116 3931.4k
roboflow/notebooks
Examples and tutorials on using SOTA computer vision models and techniques. Learn everything from old-school ResNet, through YOLO and object-detection transformers like DETR, to the latest models like Grounding DINO and SAM.
Language:Jupyter Notebook5.7k 79 144904
roboflow/awesome-openai-vision-api-experiments
Must-have resource for anyone who wants to experiment with and build on the OpenAI vision API 🔥
Language:Python1.7k 26 5133
FoundationVision/GLEE
[CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale
Language:Python1.1k 48 4886
IDEA-Research/Grounding-DINO-1.5-API
Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model Series
Language:Python826 14 4727
SkalskiP/awesome-foundation-and-multimodal-models
👁️ + 💬 + 🎧 = 🤖 Curated list of top foundation and multimodal models! [Paper + Code + Examples + Tutorials]
Language:Python590 26 444
segments-ai/panoptic-segment-anything
Combining Segment Anything (SAM) with Grounded DINO for zero-shot object detection and CLIPSeg for zero-shot segmentation
Language:Jupyter Notebook388 8 525
wanghao9610/OV-DINO
Official implementation of OV-DINO: Unified Open-Vocabulary Detection with Language-Aware Selective Fusion
Language:Python270 8 5315
Charles-Xie/awesome-described-object-detection
A curated list of papers and resources related to Described Object Detection, Open-Vocabulary/Open-World Object Detection and Referring Expression Comprehension. Updated frequently and pull requests welcomed.
225 9 017
FoundationVision/GenerateU
[CVPR2024] Generative Region-Language Pretraining for Open-Ended Object Detection
Language:Python150 7 156
CVMI-Lab/CoDet
(NeurIPS2023) CoDet: Co-Occurrence Guided Region-Word Alignment for Open-Vocabulary Object Detection
Language:Python115 6 218
naver/shine
[CVPR'24 Highlight] SHiNe: Semantic Hierarchy Nexus for Open-vocabulary Object Detection
Language:Python110 6 88
shikras/d-cube
A detection/segmentation dataset with labels characterized by intricate and flexible expressions. "Described Object Detection: Liberating Object Detection with Flexible Expressions" (NeurIPS 2023).
Language:Python108 8 157
rohit901/cooperative-foundational-models
[WACV 2025] Official code for our paper "Enhancing Novel Object Detection via Cooperative Foundational Models"
Language:Python60 7 74
lorebianchi98/FG-OVD
[CVPR2024 Highlight] Official repository of the paper "The devil is in the fine-grained details: Evaluating open-vocabulary object detectors for fine-grained understanding."
Language:Python47 5 103
ibaiGorordo/ONNX-YOLO-World-Open-Vocabulary-Object-Detection
Python scripts performing Open Vocabulary Object Detection using the YOLO-World model in ONNX.
Language:Python46 2 16
hpc203/GroundingDINO-onnxrun
使用onnxruntime部署GroundingDINO开放世界目标检测，包含C++和Python两个版本的程序
Language:Python44 3 36
om-ai-lab/OVDEval
A Comprehensive Evaluation Benchmark for Open-Vocabulary Detection (AAAI 2024)
Language:Python40 5 32
jaychempan/LAE-DINO
🦕 [AAAI'25] Official Code for “Locate Anything on Earth: Advancing Open-Vocabulary Object Detection for Remote Sensing Community"
34 5 11
wusize/CLIM
[AAAI2024] Code Release of CLIM: Contrastive Language-Image Mosaic for Region Representation
Language:Python24 1 23
mala-lab/SIC-CADS
Code Implementation of "Simple Image-level Classification Improves Open-vocabulary Object Detection" (AAAI'24)
Language:Python22 1 93
lorebianchi98/FG-CLIP
[CBMI2024 Best Paper] Official repository of the paper "Is CLIP the main roadblock for fine-grained open-world perception?".
Language:Jupyter Notebook21 4 20
lartpang/OVCamo
(ECCV 2024) Open-Vocabulary Camouflaged Object Segmentation
Language:Python20 3 21
CUHK-AIM-Group/CLIFF
[ECCV' 24] CLIFF: Continual Latent Diffusion for Open-Vocabulary Object Detection
Language:Python18 1 30
hpc203/Open-Vocabulary-Object-Detection-opencv-onnxrun
使用OpenCV+onnxruntime部署开放域目标检测，包含C++和Python两个版本的程序
Language:C++11 1 10
EasyWalk-PRIN/OpenNav
Official code for the OpenNav: Efficient Open Vocabulary 3D Object Detection for Smart Wheelchair Navigation - ACVR Workshop at ECCV'24
Language:Python4 2 11

open-vocabulary-detection

IDEA-Research/Grounded-Segment-Anything

roboflow/notebooks

roboflow/awesome-openai-vision-api-experiments

FoundationVision/GLEE

IDEA-Research/Grounding-DINO-1.5-API

SkalskiP/awesome-foundation-and-multimodal-models

segments-ai/panoptic-segment-anything

wanghao9610/OV-DINO

Charles-Xie/awesome-described-object-detection

FoundationVision/GenerateU

CVMI-Lab/CoDet

naver/shine

shikras/d-cube

rohit901/cooperative-foundational-models

lorebianchi98/FG-OVD

ibaiGorordo/ONNX-YOLO-World-Open-Vocabulary-Object-Detection

hpc203/GroundingDINO-onnxrun

om-ai-lab/OVDEval

jaychempan/LAE-DINO

wusize/CLIM

mala-lab/SIC-CADS

lorebianchi98/FG-CLIP

lartpang/OVCamo

CUHK-AIM-Group/CLIFF

hpc203/Open-Vocabulary-Object-Detection-opencv-onnxrun

EasyWalk-PRIN/OpenNav