sankin97's Stars
PJLab-ADG/awesome-knowledge-driven-AD
A curated list of awesome knowledge-driven autonomous driving (continually updated)
waymo-research/waymax
A JAX-based simulator for autonomous driving research.
microsoft/SoM
Set-of-Mark Prompting for GPT-4V and LMMs
facebookresearch/votenet
Deep Hough Voting for 3D Object Detection in Point Clouds
jy0205/LaVIT
LaVIT: Empower the Large Language Model to Understand and Generate Visual Content
PointsCoder/GPT-Driver
Learning to Drive with GPT
PJLab-ADG/DiLu
[ICLR 2024] DiLu: A Knowledge-Driven Approach to Autonomous Driving with Large Language Models
baaivision/Uni3D
[ICLR'24 Spotlight] Uni3D: 3D Visual Representation from BAAI
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
wayveai/Driving-with-LLMs
PyTorch implementation for the paper "Driving with LLMs: Fusing Object-Level Vector Modality for Explainable Autonomous Driving"
facebookresearch/home-robot
Mobile manipulation research tools for roboticists
JeffWang987/DriveDreamer
[ECCV 2024] DriveDreamer: Towards Real-world-driven World Models for Autonomous Driving
dvlab-research/Mask-Attention-Free-Transformer
Official Implementation for "Mask-Attention-Free Transformer for 3D Instance Segmentation"
CuiRuikai/Partial2Complete
[ICCV 2023] P2C: Self-Supervised Point Cloud Completion from Single Partial Clouds
quotation2520/PG-RCNN
Implementation of "PG-RCNN: Semantic Surface Point Generation for 3D Object Detection" (ICCV 2023)
ZrrSkywalker/MonoDETR
[ICCV 2023] The first DETR model for monocular 3D object detection with depth-guided transformer
MarSaKi/VLN-BEVBert
[ICCV 2023} Official repo of "BEVBert: Multimodal Map Pre-training for Language-guided Navigation"
OpenDriveLab/DriveLM
[ECCV 2024 Oral] DriveLM: Driving with Graph Visual Question Answering
Daniel-xsy/RoboBEV
RoboBEV: Towards Robust Bird's Eye View Perception under Common Corruption and Domain Shift
zhuoxiao-chen/ReDB-DA-3Ddet
[ICCV 2023] Revisiting Domain-Adaptive 3D Object Detection by Reliable, Diverse and Class-balanced Pseudo-Labeling
hustvl/VAD
[ICCV 2023] VAD: Vectorized Scene Representation for Efficient Autonomous Driving
InternLM/lagent
A lightweight framework for building LLM-based agents
A-suozhang/ada3d
Code of ICCV23 paper: Ada3D : Exploiting the Spatial Redundancy with Adaptive Inference for Efficient 3D Object Detection
Haiyang-W/UniTR
[ICCV2023] Official Implementation of "UniTR: A Unified and Efficient Multi-Modal Transformer for Bird’s-Eye-View Representation"
NVlabs/FocalFormer3D
Official PyTorch implementation of FocalFormer3D [ICCV 2023]
dvlab-research/LISA
Project Page for "LISA: Reasoning Segmentation via Large Language Model"
zehuichen123/NoiseDet
[ICCV2023] NoiseDet: Learning from Noisy Data for Semi-Superivsed 3D Object Detection
Victorwz/LongMem
Official implementation of our NeurIPS 2023 paper "Augmenting Language Models with Long-Term Memory".
yichen928/SparseFusion
[ICCV 2023] SparseFusion: Fusing Multi-Modal Sparse Representations for Multi-Sensor 3D Object Detection
qiantianwen/NuScenes-QA
[AAAI 2024] NuScenes-QA: A Multi-modal Visual Question Answering Benchmark for Autonomous Driving Scenario.