skskgrowl

skskgrowl's Stars

NotACracker/COTR
[CVPR24] COTR: Compact Occupancy TRansformer for Vision-based 3D Occupancy Prediction
Language:Python514
keithAND2020/awesome-Occupancy-research
Papers on occupation, including monocular and multi-view in autonomous driving scenarios
372
feizc/DiS
Scalable Diffusion Models with State Space Backbone
Language:Python1478
weiyithu/SurroundOcc
[ICCV 2023] SurroundOcc: Multi-camera 3D Occupancy Prediction for Autonomous Driving
Language:Python796100
ldtho/DifFUSER
DifFUSER: Diffusion Model for Robust Multi-Sensor Fusion in 3D Object Detection and BEV Segmentation
331
duanyiqun/DiffusionDepth
PyTorch Implementation of introducing diffusion approach to 3D depth perception ECCV 2024
Language:Python30017
dome272/MaskGIT-pytorch
Pytorch implementation of MaskGIT: Masked Generative Image Transformer (https://arxiv.org/pdf/2202.04200.pdf)
Language:Python40534
Rorisis/Co-Occ
[IEEE RA-L] Co-Occ: Coupling Explicit Feature Fusion with Volume Rendering Regularization for Multi-Modal 3D Semantic Occupancy Prediction
Language:Python424
Event-AHU/Mamba_State_Space_Model_Paper_List
[Mamba-Survey-2024] Paper list for State-Space-Model/Mamba and it's Applications
59533
autodriving-heart/Awesome-Autonomous-Driving
awesome-autonomous-driving
66175
BarqueroGerman/FlowMDM
[CVPR 2024] Official Implementation of "Seamless Human Motion Composition with Blended Positional Encodings".
Language:Python2028
OpenRobotLab/UniHSI
[ICLR 2024 Spotlight] Unified Human-Scene Interaction via Prompted Chain-of-Contacts
Language:Python1577
Szy-Young/ActFormer
🔥ActFormer in PyTorch (ICCV 2023)
Language:Python534
OpenMotionLab/MotionGPT
[NeurIPS 2023] MotionGPT: Human Motion as a Foreign Language, a unified motion-language generation model using LLMs
Language:Python1.5k91
scenediffuser/Scene-Diffuser
Official implementation of CVPR23 paper "Diffusion-based Generation, Optimization, and Planning in 3D Scenes"
Language:Python34324
afford-motion/afford-motion
Official implementation of CVPR24 highlight paper "Move as You Say, Interact as You Can: Language-guided Human Motion Generation with Scene Affordance"
Language:Python1096
UrbanArchitect/UrbanArchitect
The official repository of our paper: "Urban Architect: Steerable 3D Urban Scene Generation with Layout Prior"
Language:Python1067
52CV/CVPR-2024-Papers
73744
UMass-Foundation-Model/3D-VLA
[ICML 2024] 3D-VLA: A 3D Vision-Language-Action Generative World Model
Language:Python32813
embodied-generalist/embodied-generalist
[ICML 2024] Official code repository for 3D embodied generalist agent LEO
Language:Python35330
allenai/Holodeck
CVPR 2024: Language Guided Generation of 3D Embodied AI Environments.
Language:Python32731
hzxie/CityDreamer
The official implementation of "CityDreamer: Compositional Generative Model of Unbounded 3D Cities". (Xie et al., CVPR 2024)
Language:Python59942
3dlg-hcvc/M3DRef-CLIP
[ICCV 2023] Multi3DRefer: Grounding Text Description to Multiple 3D Objects
Language:Python733
Open3DA/LL3DA
[CVPR 2024] "LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding, Reasoning, and Planning"; an interactive Large Language 3D Assistant.
Language:Python2349
scene-verse/SceneVerse
Official implementation of ECCV24 paper "SceneVerse: Scaling 3D Vision-Language Learning for Grounded Scene Understanding"
Language:Python1792
OpenRobotLab/EmbodiedScan
[CVPR 2024 & NeurIPS 2024] EmbodiedScan: A Holistic Multi-Modal 3D Perception Suite Towards Embodied AI
Language:Python47134
MzeroMiko/VMamba
VMamba: Visual State Space Models，code is based on mamba
Language:Python2.1k130
hustvl/Vim
[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model
Language:Python2.9k193
ATR-DBI/CityRefer
Language:Python352
ZhanYang-nwpu/Mono3DVG
[AAAI 2024] Mono3DVG: 3D Visual Grounding in Monocular Images, AAAI, 2024
Language:Python281