jzhzhang's Stars
AtsushiSakai/PythonRobotics
Python sample codes for robotics algorithms.
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
kornia/kornia
Geometric Computer Vision Library for Spatial AI
zhm-real/PathPlanning
Common used path planning algorithms with animations.
huggingface/lerobot
🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning
isaac-sim/IsaacGymEnvs
Isaac Gym Reinforcement Learning Environments
yunlong10/Awesome-LLMs-for-Video-Understanding
🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs.
Xharlie/pointnerf
Point-NeRF: Point-based Neural Radiance Fields
sunset1995/DirectVoxGO
Direct voxel grid optimization for fast radiance field reconstruction.
facebookresearch/ToMe
A method to increase the speed and lower the memory footprint of existing vision transformers.
octo-models/octo
Octo is a transformer-based robot policy trained on a diverse mix of 800k robot trajectories.
NVlabs/curobo
CUDA Accelerated Robot Library
chenhsuanlin/bundle-adjusting-NeRF
BARF: Bundle-Adjusting Neural Radiance Fields 🤮 (ICCV 2021 oral)
dvlab-research/LLaMA-VID
LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models (ECCV 2024)
jzhzhang/ROSEFusion
[SIGGRAPH 2021] ROSEFusion is proposed to tackle the difficulties in fast-motion camera tracking using random optimization with depth information only.
robodhruv/drive-any-robot
Official code and checkpoint release for "GNM: A General Navigation Model to Drive Any Robot".
bytedance/GR-1
Code for "Unleashing Large-Scale Video Generative Pre-training for Visual Robot Manipulation"
YicongHong/Thinking-VLN
Ideas and thoughts about the fascinating Vision-and-Language Navigation
GengzeZhou/NavGPT
[AAAI 2024] Official implementation of NavGPT: Explicit Reasoning in Vision-and-Language Navigation with Large Language Models
wz0919/ScaleVLN
[ICCV 2023 Oral]: Scaling Data Generation in Vision-and-Language Navigation
zd11024/NaviLLM
[CVPR 2024] The code for paper 'Towards Learning a Generalist Model for Embodied Navigation'
cshizhe/VLN-HAMT
Official implementation of History Aware Multimodal Transformer for Vision-and-Language Navigation (NeurIPS'21).
facebookresearch/spot-sim2real
Spot Sim2Real Infrastructure
PKU-EPIC/MaskClustering
[CVPR 24] MaskClustering: View Consensus based Mask Graph Clustering for Open-Vocabulary 3D Instance Segmentation
pearl-robot-lab/rlmmbp
Learning mobile manipulation behaviors through reinforcement learning
user432/gamma
yjtang249/MIPSFusion
[SIGGRAPH Asia 2023] MIPSFusion is a neural SLAM method based on multi-implicit-submap representation for scalable online RGB-D reconstruction.
rising-turtle/DUI_VIO
Depth uncertainty incorporated VIO
joannetruong/habitat-lab
A modular high-level library to train embodied AI agents across a variety of tasks, environments, and simulators.
LRLVEC/glframework
A light weight multi-window rendering and user interface framework for OpenGL and OpenXR applications