open-vocabulary-detection
There are 26 repositories under open-vocabulary-detection topic.
IDEA-Research/Grounded-Segment-Anything
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
roboflow/notebooks
Examples and tutorials on using SOTA computer vision models and techniques. Learn everything from old-school ResNet, through YOLO and object-detection transformers like DETR, to the latest models like Grounding DINO and SAM.
roboflow/awesome-openai-vision-api-experiments
Must-have resource for anyone who wants to experiment with and build on the OpenAI vision API 🔥
FoundationVision/GLEE
[CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale
IDEA-Research/Grounding-DINO-1.5-API
Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model Series
SkalskiP/awesome-foundation-and-multimodal-models
👁️ + 💬 + 🎧 = 🤖 Curated list of top foundation and multimodal models! [Paper + Code + Examples + Tutorials]
segments-ai/panoptic-segment-anything
Combining Segment Anything (SAM) with Grounded DINO for zero-shot object detection and CLIPSeg for zero-shot segmentation
wanghao9610/OV-DINO
Official implementation of OV-DINO: Unified Open-Vocabulary Detection with Language-Aware Selective Fusion
Charles-Xie/awesome-described-object-detection
A curated list of papers and resources related to Described Object Detection, Open-Vocabulary/Open-World Object Detection and Referring Expression Comprehension. Updated frequently and pull requests welcomed.
FoundationVision/GenerateU
[CVPR2024] Generative Region-Language Pretraining for Open-Ended Object Detection
CVMI-Lab/CoDet
(NeurIPS2023) CoDet: Co-Occurrence Guided Region-Word Alignment for Open-Vocabulary Object Detection
naver/shine
[CVPR'24 Highlight] SHiNe: Semantic Hierarchy Nexus for Open-vocabulary Object Detection
shikras/d-cube
A detection/segmentation dataset with labels characterized by intricate and flexible expressions. "Described Object Detection: Liberating Object Detection with Flexible Expressions" (NeurIPS 2023).
rohit901/cooperative-foundational-models
[WACV 2025] Official code for our paper "Enhancing Novel Object Detection via Cooperative Foundational Models"
lorebianchi98/FG-OVD
[CVPR2024 Highlight] Official repository of the paper "The devil is in the fine-grained details: Evaluating open-vocabulary object detectors for fine-grained understanding."
ibaiGorordo/ONNX-YOLO-World-Open-Vocabulary-Object-Detection
Python scripts performing Open Vocabulary Object Detection using the YOLO-World model in ONNX.
hpc203/GroundingDINO-onnxrun
使用onnxruntime部署GroundingDINO开放世界目标检测,包含C++和Python两个版本的程序
om-ai-lab/OVDEval
A Comprehensive Evaluation Benchmark for Open-Vocabulary Detection (AAAI 2024)
jaychempan/LAE-DINO
🦕 [AAAI'25] Official Code for “Locate Anything on Earth: Advancing Open-Vocabulary Object Detection for Remote Sensing Community"
wusize/CLIM
[AAAI2024] Code Release of CLIM: Contrastive Language-Image Mosaic for Region Representation
mala-lab/SIC-CADS
Code Implementation of "Simple Image-level Classification Improves Open-vocabulary Object Detection" (AAAI'24)
lorebianchi98/FG-CLIP
[CBMI2024 Best Paper] Official repository of the paper "Is CLIP the main roadblock for fine-grained open-world perception?".
lartpang/OVCamo
(ECCV 2024) Open-Vocabulary Camouflaged Object Segmentation
CUHK-AIM-Group/CLIFF
[ECCV' 24] CLIFF: Continual Latent Diffusion for Open-Vocabulary Object Detection
hpc203/Open-Vocabulary-Object-Detection-opencv-onnxrun
使用OpenCV+onnxruntime部署开放域目标检测,包含C++和Python两个版本的程序
EasyWalk-PRIN/OpenNav
Official code for the OpenNav: Efficient Open Vocabulary 3D Object Detection for Smart Wheelchair Navigation - ACVR Workshop at ECCV'24