Pinned Repositories
3D-VisTA
Official implementation of ICCV 2023 paper "3D-VisTA: Pre-trained Transformer for 3D Vision and Text Alignment"
afford-motion
Official implementation of CVPR24 highlight paper "Move as You Say, Interact as You Can: Language-guided Human Motion Generation with Scene Affordance"
VLN-VER
[CVPR24] Volumetric Environment Representation for Vision-Language Navigation
occ-flow
[CoRL2024] Let Occ Flow: Self-Supervised 3D Occupancy Flow Prediction
Cam4DOcc
[CVPR 2024] Cam4DOcc: Benchmark for Camera-Only 4D Occupancy Forecasting in Autonomous Driving Applications
HOI-Diff
HOI-Diff: Text-Driven Synthesis of 3D Human-Object Interactions using Diffusion Models, arXiv 2023
FKVSWIN_depth
OccSora
OccSora: 4D Occupancy Generation Models as World Simulators for Autonomous Driving
D2-World
Official PyTorch implementation of D^2-World as the second place of CVPR 2024 Predictive World Model Challenge.
Chat-Scene
Code for "Chat-Scene: Bridging 3D Scene and Large Language Models with Object Identifiers" (NeurIPS 2024)