RoyYang0714's Stars
IDEA-Research/Grounded-Segment-Anything
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
graphdeco-inria/gaussian-splatting
Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"
Dao-AILab/flash-attention
Fast and memory-efficient exact attention
mlfoundations/open_clip
An open source implementation of CLIP.
LiheYoung/Depth-Anything
[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation
magicleap/SuperGluePretrainedNetwork
SuperGlue: Learning Feature Matching with Graph Neural Networks (CVPR 2020, Oral)
mcordts/cityscapesScripts
README and scripts for the Cityscapes Dataset
IDEA-Research/detrex
detrex is a research platform for DETR-based object detection, segmentation, pose estimation and other visual recognition tasks.
ytongbai/LVM
YvanYin/Metric3D
The repo for "Metric3D: Towards Zero-shot Metric 3D Prediction from A Single Image" and "Metric3Dv2: A Versatile Monocular Geometric Foundation Model..."
OpenDriveLab/Birds-eye-view-Perception
[IEEE T-PAMI] Awesome BEV perception research and cookbook for all level audience in autonomous diriving
IDEA-Research/MaskDINO
[CVPR 2023] Official implementation of the paper "Mask DINO: Towards A Unified Transformer-based Framework for Object Detection and Segmentation"
FoundationVision/GLEE
[CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale
facebookresearch/CutLER
Code release for "Cut and Learn for Unsupervised Object Detection and Instance Segmentation" and "VideoCutLER: Surprisingly Simple Unsupervised Video Instance Segmentation"
UMass-Foundation-Model/3D-LLM
Code for 3D-LLM: Injecting the 3D World into Large Language Models
megvii-research/MOTRv2
[CVPR2023] MOTRv2: Bootstrapping End-to-End Multi-Object Tracking by Pretrained Object Detectors
MCG-NJU/SparseBEV
[ICCV 2023] SparseBEV: High-Performance Sparse 3D Object Detection from Multi-Camera Videos
Ghostish/Open3DSOT
Open source library for Single Object Tracking in point clouds.
hht1996ok/EA-LSS
EA-LSS: Edge-aware Lift-splat-shot Framework for 3D BEV Object Detection
Sense-X/HoP
[ICCV 2023] Temporal Enhanced Training of Multi-view 3D Object Detector via Historical Object Prediction
jkulhanek/nerfbaselines
Reproducible evaluation of NeRF methods
SysCV/r3d3
MCG-NJU/MeMOTR
[ICCV 2023] MeMOTR: Long-Term Memory-Augmented Transformer for Multi-Object Tracking
scannetpp/scannetpp
autonomousvision/murf
[CVPR'24] MuRF: Multi-Baseline Radiance Fields
NaiyuGao/PanopticDepth
PanopticDepth: A Unified Framework for Depth-aware Panoptic Segmentation (CVPR2022)
chinhsuanwu/ifusion
Official PyTorch implementation of "iFusion: Inverting Diffusion for Pose-Free Reconstruction from Sparse Views"
drilistbox/3DPPE
mincheoree/BEVMap
[WACV 2024] This is the official implementation of BEVMap, a map-aware BEV modeling framework for multiview-camera detection
jwh97nn/DeepDPS
[ICCV 2023] Towards Deeply Unified Depth-aware Panoptic Segmentation with Bi-directional Guidance Learning