XuDongHecs's Stars
hpcaitech/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
IDEA-Research/Grounded-Segment-Anything
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
THU-MIG/yolov10
YOLOv10: Real-Time End-to-End Object Detection [NeurIPS 2024]
naver/dust3r
DUSt3R: Geometric 3D Vision Made Easy
lyuwenyu/RT-DETR
[CVPR 2024] Official RT-DETR (RTDETR paddle pytorch), Real-Time DEtection TRansformer, DETRs Beat YOLOs on Real-time Object Detection. 🔥 🔥 🔥
IDEA-Research/T-Rex
[ECCV2024] API code for T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy
Thinklab-SJTU/Awesome-LLM4AD
A curated list of awesome LLM for Autonomous Driving resources (continually updated)
dcharatan/flowmap
[3DV 2025] Code for "FlowMap: High-Quality Camera Poses, Intrinsics, and Depth via Gradient Descent" by Cameron Smith*, David Charatan*, Ayush Tewari, and Vincent Sitzmann
IDEA-Research/Grounding-DINO-1.5-API
Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model Series
henry123-boy/SpaTracker
[CVPR 2024 Highlight] Official PyTorch implementation of SpatialTracker: Tracking Any 2D Pixels in 3D Space
Jeff-LiangF/streamv2v
Official Pytorch implementation of StreamV2V.
Hujiazeng/Vach
Real time streaming talking head
nnanhuang/S3Gaussian
Official Implementation of Self-Supervised Street Gaussians for Autonomous Driving
swc-17/SparseDrive
SparseDrive: End-to-End Autonomous Driving via Sparse Scene Representation
UX-Decoder/DINOv
[CVPR 2024] Official implementation of the paper "Visual In-context Learning"
wzzheng/OccWorld
[ECCV 2024] 3D World Model for Autonomous Driving
LightwheelAI/street-gaussians-ns
Unofficial implementation of "Street Gaussians: Modeling Dynamic Urban Scenes with Gaussian Splatting", ECCV2024.
BraveGroup/Drive-WM
[CVPR 2024] A world model for autonomous driving.
huang-yh/SelfOcc
[CVPR 2024] SelfOcc: Self-Supervised Vision-Based 3D Occupancy Prediction
OpenDriveLab/ViDAR
[CVPR 2024 Highlight] Visual Point Cloud Forecasting
OpenDriveLab/LaneSegNet
[ICLR 2024] Map Learning with Lane Segment for Autonomous Driving
PJLab-ADG/DriveArena
DriveArena: A Closed-loop Generative Simulation Platform for Autonomous Driving
jhaoshao/ChronoDepth
ChronoDepth: Learning Temporally Consistent Video Depth from Video Diffusion Priors
Pointcept/OpenIns3D
[ECCV'24] OpenIns3D: Snap and Lookup for 3D Open-vocabulary Instance Segmentation
wzzheng/PointOcc
Efficient Point-based 3D Semantic Occupancy Prediction
HorizonRobotics/GUMP
Generative model for Unified Motion Planning tasks
NVIDIA/nvImageCodec
A nvImageCodec library of GPU- and CPU- accelerated codecs featuring a unified interface
NVlabs/DQTrack
Official PyTorch implementation of End-to-end 3D Tracking with Decoupled Queries [ICCV 2023]
PeidongLi/SSR
Robertwyq/Object-Affinity
[TPAMI 2023] Object Affinity Learning: Towards Annotation-free Instance Segmentation