Pinned Repositories
3D-Box-Segment-Anything
We extend Segment Anything to 3D perception by combining it with VoxelNeXt.
3D-RetinaNet
3D-RetinaNet a baseline models on ROAD dataset
AMENet
aot-benchmark
An efficient modular implementation of Associating Objects with Transformers for Video Object Segmentation in PyTorch
DCENet
Exploring Dynamic Context for Multi-path Trajectory Prediction
GATraj
Official PyTorch Implementation of "GATraj: A Graph- and Attention-based Multi-Agent Trajectory Prediction Model"
MCENET
QCNet
[CVPR 2023] Query-Centric Trajectory Prediction
Trajectory-Visualization
This project aims at visualizing observed and predicted trajectories with animation
trajectory_processing
haohao11's Repositories
haohao11/QCNet
[CVPR 2023] Query-Centric Trajectory Prediction
haohao11/3D-Box-Segment-Anything
We extend Segment Anything to 3D perception by combining it with VoxelNeXt.
haohao11/aot-benchmark
An efficient modular implementation of Associating Objects with Transformers for Video Object Segmentation in PyTorch
haohao11/ConditionalDETR
This repository is an official implementation of the ICCV 2021 paper "Conditional DETR for Fast Training Convergence". (https://arxiv.org/abs/2108.06152)
haohao11/DAB-DETR
[ICLR 2022] Official implementation of the paper "DAB-DETR: Dynamic Anchor Boxes are Better Queries for DETR"
haohao11/DeepAccident
Code for the benchmark - DeepAccident: A Motion and Accident Prediction Benchmark for V2X Autonomous Driving.
haohao11/DeepSeek-V3
haohao11/Deformable-DETR
Deformable DETR: Deformable Transformers for End-to-End Object Detection.
haohao11/DriveLM
DriveLM: Driving with Graph Visual Question Answering
haohao11/DrivingWorld
Code for "DrivingWorld: Constructing World Model for Autonomous Driving via Video GPT"
haohao11/FastSAM
Fast Segment Anything
haohao11/futr3d
Code for paper: FUTR3D: a unified sensor fusion framework for 3d detection
haohao11/GevBEV
haohao11/GPCIS_CVPR2023
haohao11/Grounded-Segment-Anything
Marrying Grounding DINO with Segment Anything & Stable Diffusion & Tag2Text & BLIP & Whisper & ChatBot - Automatically Detect , Segment and Generate Anything with Image, Text, and Audio Inputs
haohao11/H-Deformable-DETR
[CVPR2023] This is an official implementation of paper "DETRs with Hybrid Matching".
haohao11/l5kit
L5Kit - https://woven.toyota
haohao11/LAformer
Official PyTorch Implementation of "LAformer: Trajectory Prediction for Autonomous Driving with Lane-Aware Scene Constraints"
haohao11/MaskDINO
[CVPR 2023] Official implementation of the paper "Mask DINO: Towards A Unified Transformer-based Framework for Object Detection and Segmentation"
haohao11/mile
PyTorch code for the paper "Model-Based Imitation Learning for Urban Driving".
haohao11/MobileSAM
This is the offiicial code for Faster Segment Anything (MobileSAM) project that makes SAM lightweight
haohao11/PF-Track
Implementation of PF-Track
haohao11/Segment-and-Track-Anything
An open-source project dedicated to tracking and segmenting any objects in videos, either automatically or interactively. The primary algorithms utilized include the Segment Anything Model (SAM) for key-frame segmentation and Associating Objects with Transformers (AOT) for efficient tracking and propagation purposes.
haohao11/simple_bev
A Simple Baseline for BEV Perception
haohao11/Sparse4D
Sparse4D v1 & v2
haohao11/UniAD
[CVPR 2023 Award Candidate] Planning-oriented Autonomous Driving
haohao11/unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
haohao11/Vim
Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model
haohao11/ViP3D
haohao11/VoxelNeXt
VoxelNeXt: Fully Sparse VoxelNet for 3D Object Detection and Tracking (CVPR 2023)