luoao-kddi

luoao-kddi's Stars

filaPro/unidet3d
UniDet3D: Multi-dataset Indoor 3D Object Detection
Language:Python341
weishuaiSong/tr3d
Getting tr3d results in SUNRGB
Language:Python1
Daner-Wang/VTC-LFC
Language:Python233
ridgerchu/matmulfreellm
Implementation for MatMul-free LM.
Language:Python2.9k179
SamsungLabs/tr3d
[ICIP2023] TR3D: Towards Real-Time Indoor 3D Object Detection
Language:Python1469
Asterisci/Point-GCC
[ACMMM 2024] Point-GCC: Universal Self-supervised 3D Scene Pre-training via Geometry-Color Contrast
Language:Python272
JonasSchult/Mask3D
Mask3D predicts accurate 3D semantic instances achieving state-of-the-art on ScanNet, ScanNet200, S3DIS and STPLS3D.
Language:Python535107
igizuxo/TSC-PCAC
Point Cloud Attribute Compression with Sparse Convolution and Voxel Transformer
Language:Python102
lslrh/DMA
Official code of DMA: Dense Multimodal Alignment for Open-Vocabulary 3D Scene Understanding, ECCV 2024
Language:Python23
jmliu206/LIC_TCM
Language:Python14422
ymxlzgy/commonscenes
[NeurIPS 2023] The repo of CommonScenes, a scene generation method powered by the diffusion model.
Language:Python744
xinyu1205/recognize-anything
Open-source and strong foundation image recognition models.
Language:Jupyter Notebook2.8k271
microsoft/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Language:Python35k4.1k
OpenGVLab/LAMM
[NeurIPS 2023 Datasets and Benchmarks Track] LAMM: Multi-Modal Large Language Models and Applications as AI Agents
Language:Python29716
WaldJohannaU/3RScan
3RScan Toolkit
Language:C++18619
OpenM3D/M3DBench
[ECCV 2024] M3DBench introduces a comprehensive 3D instruction-following dataset with support for interleaved multi-modal prompts.
Language:Python552
daveredrum/Scan2Cap
[CVPR 2021] Scan2Cap: Context-aware Dense Captioning in RGB-D Scans
Language:Python9915
OpenRobotLab/EmbodiedScan
[CVPR 2024 & NeurIPS 2024] EmbodiedScan: A Holistic Multi-Modal 3D Perception Suite Towards Embodied AI
Language:Python46234
nelhage/reptyr
Reparent a running program to a new terminal
Language:C5.8k215
robot-pesg/BotanicGarden
BotanicGarden: A high-quality dataset for robot navigation in unstructured natural environments
Language:Jupyter Notebook16514
TangYuan96/MiniGPT-3D
[MM 2024] [Need a RTX 3090] MiniGPT-3D: Efficiently Aligning 3D Point Clouds with Large Language Models using 2D Priors
Language:Python625
KuanchihHuang/Reason3D
Reason3D: Searching and Reasoning 3D Segmentation via Large Language Model
421
JieyuZ2/TaskMeAnything
[NeurIPS 2024] A task generation and model evaluation system for multimodal language models.
Language:Python543
filaPro/oneformer3d
[CVPR2024] OneFormer3D: One Transformer for Unified Point Cloud Segmentation
Language:Python30727
facebookresearch/detectron2
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
Language:Python30.1k7.4k
facebookresearch/Mask2Former
Code release for "Masked-attention Mask Transformer for Universal Image Segmentation"
Language:Python2.5k379
V-DETR/V-DETR
[ICLR 2024] This is the official code of the paper "V-DETR: DETR with Vertex Relative Position Encoding for 3D Object Detection"
Language:Python883
embodied-generalist/embodied-generalist
[ICML 2024] Official code repository for 3D embodied generalist agent LEO
Language:Python34630
ZrrSkywalker/Point-NN
[CVPR 2023] Parameter is Not All You Need: Starting from Non-Parametric Networks for 3D Point Cloud Analysis
Language:Python47850
luost26/diffusion-point-cloud
:thought_balloon: Diffusion Probabilistic Models for 3D Point Cloud Generation (CVPR 2021)
Language:Python63590