Yingdong-Hu's Stars
Zeyi-Lin/HivisionIDPhotos
⚡️HivisionIDPhotos: a lightweight and efficient AI ID photos tools. 一个轻量级的AI证件照制作算法。
facebookresearch/segment-anything-2
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
ShiArthur03/ShiArthur03
rerun-io/rerun
Visualize streams of multimodal data. Free, fast, easy to use, and simple to integrate. Built in Rust.
cambrian-mllm/cambrian
Cambrian-1 is a family of multimodal LLMs with a vision-centric design.
karpathy/nano-llama31
nanoGPT style version of Llama 3.1
LTH14/mar
PyTorch implementation of MAR+DiffLoss https://arxiv.org/abs/2406.11838
nerfstudio-project/viser
Web-based 3D visualization + Python
IDEA-Research/Grounding-DINO-1.5-API
Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model Series
huangwl18/ReKep
ReKep: Spatio-Temporal Reasoning of Relational Keypoint Constraints for Robotic Manipulation
unitreerobotics/avp_teleoperate
carlosferrazza/humanoid-bench
marek-simonik/record3d
Accompanying library for the Record3D iOS app (https://record3d.app/). Allows you to receive RGBD stream from iOS devices with TrueDepth camera(s).
1x-technologies/1xgpt
world modeling challenge for humanoid robots
kevinzakka/mink
Python inverse kinematics based on MuJoCo
apirrone/Open_Duck_Mini
Making a mini version of the BDX droid
MIT-SPARK/Khronos
Spatio-Temporal Metric-Semantic SLAM
dexsuite/dex-retargeting
real-stanford/umi-on-legs
UMI on Legs: Making Manipulation Policies Mobile with Manipulation-Centric Whole-body Controllers
haritheja-e/robot-utility-models
Robot Utility Models are trained on a diverse set of environments and objects, and then can be deployed in novel environments with novel objects without any further data or training.
yjy0625/equibot
Official implementation for paper "EquiBot: SIM(3)-Equivariant Diffusion Policy for Generalizable and Data Efficient Learning".
carlosferrazza/BodyTransformer
Body Transformer: Leveraging Robot Embodiment for Policy Learning
mihdalal/neuralmotionplanner
PyTorch Code for Neural MP: A Generalist Neural Motion Planner
MohitShridhar/genima
Official Code Repo for GENIMA
rerun-io/python-example-droid-dataset
Visualizing the DROID dataset using Rerun
real-stanford/maniwav
Official codebase of paper "ManiWAV: Learning Robot Manipulation from In-the-Wild Audio-Visual Data"
omarrayyann/MujocoAR
A MuJoCo plugin that enables the integration of ARKit data from a connected iOS device to control MuJoCo frames in real-time
ACETeleop/ACE_hardware
PyojinKim/ios_logger
Application for camera and sensor data logging (iOS)
eugeneteoh/greenaug
GreenAug: Green Screen Augmentation Enables Scene Generalisation in Robotic Manipulation