Yingdong-Hu's Stars
2noise/ChatTTS
A generative speech model for daily dialogue.
karpathy/LLM101n
LLM101n: Let's build a Storyteller
huggingface/lerobot
🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning
FoundationVision/VAR
[NeurIPS 2024 Oral][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!
tuna/thuthesis
LaTeX Thesis Template for Tsinghua University
karpathy/build-nanogpt
Video+code lecture on building nanoGPT from scratch
petercorke/robotics-toolbox-python
Robotics Toolbox for Python
stack-of-tasks/pinocchio
A fast and flexible implementation of Rigid Body Dynamics algorithms and their analytical derivatives
cambrian-mllm/cambrian
Cambrian-1 is a family of multimodal LLMs with a vision-centric design.
eloialonso/diamond
DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model. NeurIPS 2024 Spotlight.
dora-rs/dora
DORA (Dataflow-Oriented Robotic Architecture) is middleware designed to streamline and simplify the creation of AI-based robotic applications. It offers low latency, composable, and distributed dataflow capabilities. Applications are modeled as directed graphs, also referred to as pipelines.
openvla/openvla
OpenVLA: An open-source vision-language-action model for robotic manipulation.
eureka-research/DrEureka
Official Repository for "DrEureka: Language Model Guided Sim-To-Real Transfer" (RSS 2024)
IDEA-Research/Grounding-DINO-1.5-API
Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model Series
OpenTeleVision/TeleVision
[CoRL 2024] Open-TeleVision: Teleoperation with Immersive Active Visual Feedback
robocasa/robocasa
RoboCasa: Large-Scale Simulation of Everyday Tasks for Generalist Robots
MarkFzp/humanplus
[CoRL 2024] HumanPlus: Humanoid Shadowing and Imitation from Humans
simpler-env/SimplerEnv
Evaluating and reproducing real-world robot manipulation policies (e.g., RT-1, RT-1-X, Octo) in simulation under common setups (e.g., Google Robot, WidowX+Bridge) (CoRL 2024)
cmbruns/pyopenvr
Unofficial python bindings for Valve's OpenVR virtual reality SDK
Dingry/BunnyVisionPro
Bimanual Dexterous Teleoperation with Real-Time Retargeting using VisionPro
vuer-ai/vuer
Vuer is a 3D visualization tool for robotics and VR applications.
nicklashansen/puppeteer
Code for "Hierarchical World Models as Visual Whole-Body Humanoid Controllers"
wuphilipp/gello_software
Aaditya-Prasad/consistency-policy
[RSS 2024] Consistency Policy: Accelerated Visuomotor Policies via Consistency Distillation
yjy0625/equibot
Official implementation for paper "EquiBot: SIM(3)-Equivariant Diffusion Policy for Generalizable and Data Efficient Learning".
ToruOwO/hato
🕊️ HATO: Learning Visuotactile Skills with Two Multifingered Hands
rail-berkeley/oculus_reader
real-stanford/maniwav
Official codebase of paper "ManiWAV: Learning Robot Manipulation from In-the-Wild Audio-Visual Data"
alexander-soare/consistency_policy
Distilling Diffusion Policy into consistency models
easonyang1996/THU-poster-gemini
Tsinghua style poster template based on gemini