Abyssaledge's Stars
facebookresearch/xformers
Hackable and optimized Transformers building blocks, supporting a composable construction.
threestudio-project/threestudio
A unified framework for 3D content generation.
dvlab-research/LongLoRA
Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)
Pointcept/Pointcept
Pointcept: a codebase for point cloud perception research. Latest works: PTv3 (CVPR'24 Oral), PPT (CVPR'24), OA-CNNs (CVPR'24), MSC (CVPR'23)
Pointcept/PointTransformerV3
[CVPR'24 Oral] Official repository of Point Transformer V3 (PTv3)
facebookresearch/omni3d
Code release for "Omni3D A Large Benchmark and Model for 3D Object Detection in the Wild"
mit-han-lab/fastcomposer
[IJCV] FastComposer: Tuning-Free Multi-Subject Image Generation with Localized Attention
dvlab-research/3D-Box-Segment-Anything
We extend Segment Anything to 3D perception by combining it with VoxelNeXt.
ZiyuGuo99/Point-Bind_Point-LLM
Align 3D Point Cloud with Multi-modalities for Large Language Models
Haiyang-W/DSVT
[CVPR2023] Official Implementation of "DSVT: Dynamic Sparse Voxel Transformer with Rotated Sets"
PJLab-ADG/DetZero
[ICCV 2023] DetZero: Rethinking Offboard 3D Object Detection with Long-term Sequential Point Clouds
Haiyang-W/GiT
[ECCV2024 Oralš„] Official Implementation of "GiT: Towards Generalist Vision Transformer through Universal Language Interface"
BraveGroup/Drive-WM
[CVPR 2024] A world model for autonomous driving.
fudan-zvg/GSS
[CVPR 2023] Official repository of Generative Semantic Segmentation
MarSaKi/ETPNav
[TPAMI 2024] Official repo of "ETPNav: Evolving Topological Planning for Vision-Language Navigation in Continuous Environments"
YuxueYang1204/TrimGS
Trim 3D Gaussian Splatting for Accurate Geometry Representation
MarSaKi/VLN-BEVBert
[ICCV 2023} Official repo of "BEVBert: Multimodal Map Pre-training for Language-guided Navigation"
tusen-ai/Anchor3DLane
Official PyTorch implementation for paper`Anchor3DLane: Learning to Regress 3D Anchors for Monocular 3D Lane Detection' accepted by CVPR 2023
Robertwyq/PanoOcc
[CVPR 2024] PanoOcc: Unified Occupancy Representation for Camera-based 3D Panoptic Segmentation
zhanggang001/HEDNet
HEDNet (NeurIPS 2023) & SAFDNet (CVPR 2024 Oral)
ZiYang-xie/MV-Map
Official implementation of MV-Map: Offboard HD-Map Generation with Multi-view Consistency
BraveGroup/SheetCopilot
We release a general framework for prompting LLMs to manipulate software in a closed-loop manner.
BraveGroup/FullySparseFusion
Fully Sparse Fusion for 3D Object Detection
BraveGroup/LAW
Enhancing End-to-End Autonomous Driving with Latent World Model
skyhehe123/ScatterFormer
ScatterFormer: Efficient Voxel Transformer with Scattered Linear Attention (ECCV 2024)
gwenzhang/Voxel-Mamba
[NIPS'24] Voxel Mamba: Group-Free State Space Models for Point Cloud based 3D Object Detection
Tsinghua-MARS-Lab/GeoMAE
This is the official implementation of the paper - GeoMAE: Masked Geometric Target Prediction for Self-supervised Point Cloud Pre-Training
StiphyJay/LiDAR-PTQ
ICLR2024: LiDAR-PTQ: Post-Training Quantization for Point Cloud 3D Object Detection.
BraveGroup/PointSAM-for-MixSup
Codes for ICLR 2024: "MixSup: Mixed-grained Supervision for Label-efficient LiDAR-based 3D Object Detection"
Abyssaledge/ImmortalTracker-for-CTRL