luoao-kddi's Stars
filaPro/unidet3d
UniDet3D: Multi-dataset Indoor 3D Object Detection
weishuaiSong/tr3d
Getting tr3d results in SUNRGB
Daner-Wang/VTC-LFC
ridgerchu/matmulfreellm
Implementation for MatMul-free LM.
SamsungLabs/tr3d
[ICIP2023] TR3D: Towards Real-Time Indoor 3D Object Detection
Asterisci/Point-GCC
[ACMMM 2024] Point-GCC: Universal Self-supervised 3D Scene Pre-training via Geometry-Color Contrast
JonasSchult/Mask3D
Mask3D predicts accurate 3D semantic instances achieving state-of-the-art on ScanNet, ScanNet200, S3DIS and STPLS3D.
igizuxo/TSC-PCAC
Point Cloud Attribute Compression with Sparse Convolution and Voxel Transformer
lslrh/DMA
Official code of DMA: Dense Multimodal Alignment for Open-Vocabulary 3D Scene Understanding, ECCV 2024
jmliu206/LIC_TCM
ymxlzgy/commonscenes
[NeurIPS 2023] The repo of CommonScenes, a scene generation method powered by the diffusion model.
xinyu1205/recognize-anything
Open-source and strong foundation image recognition models.
microsoft/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
OpenGVLab/LAMM
[NeurIPS 2023 Datasets and Benchmarks Track] LAMM: Multi-Modal Large Language Models and Applications as AI Agents
WaldJohannaU/3RScan
3RScan Toolkit
OpenM3D/M3DBench
[ECCV 2024] M3DBench introduces a comprehensive 3D instruction-following dataset with support for interleaved multi-modal prompts.
daveredrum/Scan2Cap
[CVPR 2021] Scan2Cap: Context-aware Dense Captioning in RGB-D Scans
OpenRobotLab/EmbodiedScan
[CVPR 2024 & NeurIPS 2024] EmbodiedScan: A Holistic Multi-Modal 3D Perception Suite Towards Embodied AI
nelhage/reptyr
Reparent a running program to a new terminal
robot-pesg/BotanicGarden
BotanicGarden: A high-quality dataset for robot navigation in unstructured natural environments
TangYuan96/MiniGPT-3D
[MM 2024] [Need a RTX 3090] MiniGPT-3D: Efficiently Aligning 3D Point Clouds with Large Language Models using 2D Priors
KuanchihHuang/Reason3D
Reason3D: Searching and Reasoning 3D Segmentation via Large Language Model
JieyuZ2/TaskMeAnything
[NeurIPS 2024] A task generation and model evaluation system for multimodal language models.
filaPro/oneformer3d
[CVPR2024] OneFormer3D: One Transformer for Unified Point Cloud Segmentation
facebookresearch/detectron2
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
facebookresearch/Mask2Former
Code release for "Masked-attention Mask Transformer for Universal Image Segmentation"
V-DETR/V-DETR
[ICLR 2024] This is the official code of the paper "V-DETR: DETR with Vertex Relative Position Encoding for 3D Object Detection"
embodied-generalist/embodied-generalist
[ICML 2024] Official code repository for 3D embodied generalist agent LEO
ZrrSkywalker/Point-NN
[CVPR 2023] Parameter is Not All You Need: Starting from Non-Parametric Networks for 3D Point Cloud Analysis
luost26/diffusion-point-cloud
:thought_balloon: Diffusion Probabilistic Models for 3D Point Cloud Generation (CVPR 2021)