Pinned Repositories
awesome-described-object-detection
A curated list of papers and resources related to Described Object Detection, Open-Vocabulary/Open-World Object Detection and Referring Expression Comprehension. Updated frequently and pull requests welcomed.
Bongard-HOI
[CVPR 2022 (oral)] Bongard-HOI for benchmarking few-shot visual reasoning
CaCao
This is the official repository for the paper "Visually-Prompted Language Model for Fine-Grained Scene Graph Generation in an Open World" (Accepted by ICCV 2023)
classify_by_description_release
CoDet
(NeurIPS2023) CoDet: Co-Occurrence Guided Region-Word Alignment for Open-Vocabulary Object Detection
CQL
Code for our paper "Category Query Learning for Human-Object Interaction Classification" (CVPR2023)
detrex
detrex is a research platform for DETR-based object detection, segmentation, pose estimation and other visual recognition tasks.
DisAlign
Implementation of "Distribution Alignment: A Unified Framework for Long-tail Visual Recognition"(CVPR 2021)
Focus-DETR
[ICCV 2023] Official implementation of the paper "Less is More: Focus Attention for Efficient DETR"
GroundingDINO
Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
nari95park's Repositories
nari95park/awesome-described-object-detection
A curated list of papers and resources related to Described Object Detection, Open-Vocabulary/Open-World Object Detection and Referring Expression Comprehension. Updated frequently and pull requests welcomed.
nari95park/Bongard-HOI
[CVPR 2022 (oral)] Bongard-HOI for benchmarking few-shot visual reasoning
nari95park/CaCao
This is the official repository for the paper "Visually-Prompted Language Model for Fine-Grained Scene Graph Generation in an Open World" (Accepted by ICCV 2023)
nari95park/classify_by_description_release
nari95park/CoDet
(NeurIPS2023) CoDet: Co-Occurrence Guided Region-Word Alignment for Open-Vocabulary Object Detection
nari95park/CQL
Code for our paper "Category Query Learning for Human-Object Interaction Classification" (CVPR2023)
nari95park/detrex
detrex is a research platform for DETR-based object detection, segmentation, pose estimation and other visual recognition tasks.
nari95park/DisAlign
Implementation of "Distribution Alignment: A Unified Framework for Long-tail Visual Recognition"(CVPR 2021)
nari95park/Focus-DETR
[ICCV 2023] Official implementation of the paper "Less is More: Focus Attention for Efficient DETR"
nari95park/GroundingDINO
Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
nari95park/ho-rcnn
Code for reproducing the results in "Learning to Detect Human-Object Interactions"
nari95park/LLM-groundedDiffusion
LLM-grounded Diffusion: Enhancing Prompt Understanding of Text-to-Image Diffusion Models with Large Language Models (LLM-grounded Diffusion: LMD)
nari95park/nari95park
Config files for my GitHub profile.
nari95park/NMP
Code for Neural Message Passing for Visual Relationship Detection
nari95park/OpenGait
A flexible and extensible framework for gait recognition. You can focus on designing your own models and comparing with state-of-the-arts easily with the help of OpenGait.
nari95park/spot
[CVPR 2024 Highlight] :dog: SPOT: Self-Training with Patch-Order Permutation for Object-Centric Learning with Autoregressive Transformers
nari95park/Transformers-Tutorials
This repository contains demos I made with the Transformers library by HuggingFace.
nari95park/veto
Vision Relation Transformer for Unbiased Scene Graph Generation (ICCV 2023)
nari95park/VS3_CVPR23
Code for CVPR23 paper: Learning to Generate Language-supervised and Open-vocabulary Scene Graph using Pre-trained Visual-Semantic Space