changhaonan
LLM/VLM Robotics, Video generation Robotics, Scalable Robot-Learning, Non-rigid Reconstruction.
Rutgers UniversityNJ, USA
changhaonan's Stars
lllyasviel/ControlNet
Let us control diffusion models!
albumentations-team/albumentations
Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125
labelmeai/labelme
Image Polygonal Annotation with Python (polygon, rectangle, circle, line, point and image-level flag annotation).
CompVis/latent-diffusion
High-Resolution Image Synthesis with Latent Diffusion Models
IDEA-Research/GroundingDINO
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
facebookresearch/DiT
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
THUDM/CogVLM
a state-of-the-art-level open visual language model | 多模态预训练模型
OpenGVLab/LLaMA-Adapter
[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters
facebookresearch/co-tracker
CoTracker is a model for tracking any point (pixel) on a video.
yanx27/Pointnet_Pointnet2_pytorch
PointNet and PointNet++ implemented by pytorch (pure python) and on ModelNet, ShapeNet and S3DIS.
facebookresearch/jepa
PyTorch code and models for V-JEPA self-supervised learning from video.
hustvl/4DGaussians
[CVPR 2024] 4D Gaussian Splatting for Real-Time Dynamic Scene Rendering
JonathonLuiten/Dynamic3DGaussians
autonomousvision/sdfstudio
A Unified Framework for Surface Reconstruction
NVlabs/FoundationPose
[CVPR 2024 Highlight] FoundationPose: Unified 6D Pose Estimation and Tracking of Novel Objects
Pointcept/PointTransformerV3
[CVPR'24 Oral] Official repository of Point Transformer V3 (PTv3)
real-stanford/universal_manipulation_interface
Universal Manipulation Interface: In-The-Wild Robot Teaching Without In-The-Wild Robots
drprojects/superpoint_transformer
Official PyTorch implementation of Superpoint Transformer introduced in [ICCV'23] "Efficient 3D Semantic Segmentation with Superpoint Transformer" and SuperCluster introduced in [3DV'24 Oral] "Scalable 3D Panoptic Segmentation As Superpoint Graph Clustering"
robodhruv/visualnav-transformer
Official code and checkpoint release for mobile robot foundation models: GNM, ViNT, and NoMaD.
OpenRobotLab/PointLLM
[ECCV 2024 Best Paper Candidate] PointLLM: Empowering Large Language Models to Understand Point Clouds
JonasSchult/Mask3D
Mask3D predicts accurate 3D semantic instances achieving state-of-the-art on ScanNet, ScanNet200, S3DIS and STPLS3D.
sail-sg/MDT
Masked Diffusion Transformer is the SOTA for image synthesis. (ICCV 2023)
RoboFlamingo/RoboFlamingo
Code for RoboFlamingo
oneformer3d/oneformer3d
[CVPR2024] OneFormer3D: One Transformer for Unified Point Cloud Segmentation
DYZhang09/SAM3D
[SCIS] SAM3D: Zero-Shot 3D Object Detection via Segment Anything Model
lixiny/manotorch
MANO hand model in PyTorch (anatomy consistent, anchors, etc)
kumuji/volumentations
Augmentation package for 3d data based on albumentaitons
changhaonan/LGMCTS-D
changhaonan/BundleTrack
A fork of BundleTrack. Replacing the original 2D tracker with pytracking and superpoints keypoint detector.
changhaonan/KinectFusion-python