open-vocabulary-detection

There are 26 repositories under open-vocabulary-detection topic.

  • IDEA-Research/Grounded-Segment-Anything

    Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

    Language:Jupyter Notebook15.4k1163931.4k
  • roboflow/notebooks

    Examples and tutorials on using SOTA computer vision models and techniques. Learn everything from old-school ResNet, through YOLO and object-detection transformers like DETR, to the latest models like Grounding DINO and SAM.

    Language:Jupyter Notebook5.7k79144904
  • roboflow/awesome-openai-vision-api-experiments

    Must-have resource for anyone who wants to experiment with and build on the OpenAI vision API 🔥

    Language:Python1.7k265133
  • FoundationVision/GLEE

    [CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale

    Language:Python1.1k484886
  • IDEA-Research/Grounding-DINO-1.5-API

    Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model Series

    Language:Python826144727
  • SkalskiP/awesome-foundation-and-multimodal-models

    👁️ + 💬 + 🎧 = 🤖 Curated list of top foundation and multimodal models! [Paper + Code + Examples + Tutorials]

    Language:Python59026444
  • segments-ai/panoptic-segment-anything

    Combining Segment Anything (SAM) with Grounded DINO for zero-shot object detection and CLIPSeg for zero-shot segmentation

    Language:Jupyter Notebook3888525
  • wanghao9610/OV-DINO

    Official implementation of OV-DINO: Unified Open-Vocabulary Detection with Language-Aware Selective Fusion

    Language:Python27085315
  • Charles-Xie/awesome-described-object-detection

    A curated list of papers and resources related to Described Object Detection, Open-Vocabulary/Open-World Object Detection and Referring Expression Comprehension. Updated frequently and pull requests welcomed.

  • FoundationVision/GenerateU

    [CVPR2024] Generative Region-Language Pretraining for Open-Ended Object Detection

    Language:Python1507156
  • CVMI-Lab/CoDet

    (NeurIPS2023) CoDet: Co-Occurrence Guided Region-Word Alignment for Open-Vocabulary Object Detection

    Language:Python1156218
  • naver/shine

    [CVPR'24 Highlight] SHiNe: Semantic Hierarchy Nexus for Open-vocabulary Object Detection

    Language:Python110688
  • shikras/d-cube

    A detection/segmentation dataset with labels characterized by intricate and flexible expressions. "Described Object Detection: Liberating Object Detection with Flexible Expressions" (NeurIPS 2023).

    Language:Python1088157
  • rohit901/cooperative-foundational-models

    [WACV 2025] Official code for our paper "Enhancing Novel Object Detection via Cooperative Foundational Models"

    Language:Python60774
  • lorebianchi98/FG-OVD

    [CVPR2024 Highlight] Official repository of the paper "The devil is in the fine-grained details: Evaluating open-vocabulary object detectors for fine-grained understanding."

    Language:Python475103
  • ibaiGorordo/ONNX-YOLO-World-Open-Vocabulary-Object-Detection

    Python scripts performing Open Vocabulary Object Detection using the YOLO-World model in ONNX.

    Language:Python46216
  • hpc203/GroundingDINO-onnxrun

    使用onnxruntime部署GroundingDINO开放世界目标检测,包含C++和Python两个版本的程序

    Language:Python44336
  • om-ai-lab/OVDEval

    A Comprehensive Evaluation Benchmark for Open-Vocabulary Detection (AAAI 2024)

    Language:Python40532
  • jaychempan/LAE-DINO

    🦕 [AAAI'25] Official Code for “Locate Anything on Earth: Advancing Open-Vocabulary Object Detection for Remote Sensing Community"

  • wusize/CLIM

    [AAAI2024] Code Release of CLIM: Contrastive Language-Image Mosaic for Region Representation

    Language:Python24123
  • mala-lab/SIC-CADS

    Code Implementation of "Simple Image-level Classification Improves Open-vocabulary Object Detection" (AAAI'24)

    Language:Python22193
  • lorebianchi98/FG-CLIP

    [CBMI2024 Best Paper] Official repository of the paper "Is CLIP the main roadblock for fine-grained open-world perception?".

    Language:Jupyter Notebook21420
  • OVCamo

    lartpang/OVCamo

    (ECCV 2024) Open-Vocabulary Camouflaged Object Segmentation

    Language:Python20321
  • CUHK-AIM-Group/CLIFF

    [ECCV' 24] CLIFF: Continual Latent Diffusion for Open-Vocabulary Object Detection

    Language:Python18130
  • hpc203/Open-Vocabulary-Object-Detection-opencv-onnxrun

    使用OpenCV+onnxruntime部署开放域目标检测,包含C++和Python两个版本的程序

    Language:C++11110
  • EasyWalk-PRIN/OpenNav

    Official code for the OpenNav: Efficient Open Vocabulary 3D Object Detection for Smart Wheelchair Navigation - ACVR Workshop at ECCV'24

    Language:Python4211