open-vocabulary

There are 32 repositories under open-vocabulary topic.

  • om-ai-lab/OmDet

    Real-time and accurate open-vocabulary end-to-end object detection

    Language:Python1.5k9418142
  • NVlabs/ODISE

    Official PyTorch implementation of ODISE: Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models [CVPR 2023 Highlight]

    Language:Python843404245
  • jianzongwu/Awesome-Open-Vocabulary

    (TPAMI 2024) A Survey on Open Vocabulary Learning

  • ok-robot

    ok-robot/ok-robot

    An open, modular framework for zero-shot, language conditioned pick-and-drop tasks in arbitrary homes.

    Language:Python432101132
  • xmed-lab/CLIP_Surgery

    CLIP Surgery for Better Explainability with Enhancement in Open-Vocabulary Tasks

    Language:Jupyter Notebook33853122
  • witnessai/Awesome-Open-Vocabulary-Object-Detection

    A curated list of papers, datasets and resources pertaining to open vocabulary object detection.

  • CVMI-Lab/PLA

    (CVPR 2023) PLA: Language-Driven Open-Vocabulary 3D Scene Understanding & (CVPR2024) RegionPLC: Regional Point-Language Contrastive Learning for Open-World 3D Scene Understanding

    Language:Python245145211
  • clin1223/VLDet

    [ICLR 2023] PyTorch implementation of VLDet (https://arxiv.org/abs/2211.14843)

    Language:Python17851711
  • wusize/ovdet

    [CVPR2023] Code Release of Aligning Bag of Regions for Open-Vocabulary Object Detection

    Language:Python1726455
  • yangcaoai/CoDA_NeurIPS2023

    Official code for NeurIPS2023 paper: CoDA: Collaborative Novel Box Discovery and Cross-modal Alignment for Open-vocabulary 3D Object Detection

    Language:Jupyter Notebook172101215
  • wusize/CLIPSelf

    [ICLR2024 Spotlight] Code Release of CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Prediction

    Language:Python1586289
  • ngthanhtin/owlvit_segment_anything

    Combining OwlViT with Segment Anything - Open-vocabulary Detection and Segmentation (Text-conditioned, and Image-conditioned)

    Language:Jupyter Notebook1493414
  • hovsg/HOV-SG

    [RSS2024] Official implementation of "Hierarchical Open-Vocabulary 3D Scene Graphs for Language-Grounded Robot Navigation"

    Language:Python13821511
  • FoundationVision/GenerateU

    [CVPR2024] Generative Region-Language Pretraining for Open-Ended Object Detection

    Language:Python1235136
  • sunanhe/MKT

    Official implementation of "Open-Vocabulary Multi-Label Classification via Multi-Modal Knowledge Transfer".

    Language:Python1192216
  • Surrey-UP-Lab/RegionSpot

    Recognize Any Regions

    Language:Python1161154
  • CVMI-Lab/CoDet

    (NeurIPS2023) CoDet: Co-Occurrence Guided Region-Word Alignment for Open-Vocabulary Object Detection

    Language:Python1047157
  • HKUST-LongGroup/Awesome-Open-Vocabulary-Detection-and-Segmentation

    Awesome OVD-OVS - A Survey on Open-Vocabulary Detection and Segmentation: Past, Present, and Future

  • ArrowLuo/SegCLIP

    PyTorch implementation of ICML 2023 paper "SegCLIP: Patch Aggregation with Learnable Centers for Open-Vocabulary Semantic Segmentation"

    Language:Python77958
  • VinAIResearch/Open3DIS

    Open3DIS: Open-vocabulary 3D Instance Segmentation with 2D Mask Guidance (CVPR 2024)

    Language:Python614333
  • aminebdj/OpenYOLO3D

    Our OpenYOLO3D model achieves state-of-the-art performance in Open Vocabulary 3D Instance Segmentation on ScanNet200 and Replica datasets with up ∼16x speedup compared to the best existing method in literature.

    Language:Python60694
  • ajzhai/NeRF2Physics

    [CVPR 2024] Physical Property Understanding from Language-Embedded Feature Fields

    Language:Python52222
  • xuanlinli17/large_vlm_distillation_ood

    Distilling Large Vision-Language Model with Out-of-Distribution Generalizability (ICCV 2023)

    Language:Python50124
  • ldkong1205/OpenESS

    [CVPR 2024 Highlight] OpenESS: Event-Based Semantic Scene Understanding with Open Vocabularies

  • sinahmr/NACLIP

    PyTorch Implementation of NACLIP in "Pay Attention to Your Neighbours: Training-Free Open-Vocabulary Semantic Segmentation"

    Language:Python30305
  • ucas-vg/Sambor

    Sambor: Boosting Segment Anything Model Towards Open-Vocabulary Learning

  • codingonion/awesome-open-world-object-detection

    This repository lists some awesome public Open World object detection series projects.

  • OVCamo

    lartpang/OVCamo

    (ECCV 2024) Open-Vocabulary Camouflaged Object Segmentation

    Language:Python12301
  • JunweiZheng93/OPS

    Official repository for paper "Open Panoramic Segmentation" (OPS) at ECCV 2024

    Language:Python11201
  • ruohaoguo/ovavss

    Official Implementation of "Open-Vocabulary Audio-Visual Semantic Segmentation" [ACM MM 2024 Oral].

    Language:Python101
  • Fsoft-AIC/WAVER

    [ICASSP 2024 Oral] WAVER: Writing-Style Agnostic Text-Video Retrieval Via Distilling Vision-Language Models Through Open-Vocabulary Knowledge

    Language:Python10