zhangyan612's Stars
VITA-MLLM/VITA
✨✨VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction
THUDM/CogAgent
An open-sourced end-to-end VLM-based GUI Agent
Jessie940611/BAAIWorm
Genesis-Embodied-AI/Genesis
A generative world for general-purpose robotics & embodied AI learning.
blgpb/streaming-udp-video
a demo to transport video by UDP
trzy/robot-arm
Imitation learning with iPhone based teleoperation of a low-cost robot arm.
ob-f/OpenBot
OpenBot leverages smartphones as brains for low-cost robots. We have designed a small electric vehicle that costs about $50 and serves as a robot body. Our software stack for Android smartphones supports advanced robotics workloads such as person following and real-time autonomous navigation.
kscalelabs/onshape
K-Scale's library for programmatically interacting with OnShape
zeroth-robotics/zeroth-bot
3D-printed open-source humanoid robot platform for sim-to-real and RL
VITA-MLLM/Freeze-Omni
✨✨Freeze-Omni: A Smart and Low Latency Speech-to-speech Dialogue Model with Frozen LLM
engineai-robotics/engineai_humanoid
jeffffffli/HybrIK
Official code of "HybrIK: A Hybrid Analytical-Neural Inverse Kinematics Solution for 3D Human Pose and Shape Estimation", CVPR 2021
huangwl18/ReKep
ReKep: Spatio-Temporal Reasoning of Relational Keypoint Constraints for Robotic Manipulation
engineai-robotics/engineai_legged_gym
pipecat-ai/rtvi-web-demo
Example UI implementing the RTVI web client
ex3ndr/llama-coder
Replace Copilot local AI
MervinPraison/PraisonAI
PraisonAI is an AI Agents Framework with Self Reflection. PraisonAI application combines PraisonAI Agents, AutoGen, and CrewAI into a low-code solution for building and managing multi-agent LLM systems, focusing on simplicity, customisation, and efficient human–agent collaboration.
openvla/openvla
OpenVLA: An open-source vision-language-action model for robotic manipulation.
hiyouga/LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
Improbable-AI/VisionProTeleop
VisionOS App + Python Library to stream head / wrist / finger tracking data from Vision Pro to any robots.
dexsuite/dex-retargeting
Dingry/BunnyVisionPro
Bimanual Dexterous Teleoperation with Real-Time Retargeting using VisionPro
MarkFzp/humanplus
[CoRL 2024] HumanPlus: Humanoid Shadowing and Imitation from Humans
2noise/ChatTTS
A generative speech model for daily dialogue.
TommyZihao/Mycobot_Tutorials
同济子豪兄大象机械臂Mycobot 280 Pi教程。机器人运动学、逆运动学、Python控制、ROS、具身智能。
gouldpa/Odd-Mech-Assemblies
livekit/agents
Build real-time multimodal AI applications 🤖🎙️📹
eureka-research/DrEureka
Official Repository for "DrEureka: Language Model Guided Sim-To-Real Transfer" (RSS 2024)
huggingface/lerobot
🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning
UT-Austin-RPL/TRILL
Official codebase for TRILL (Teleoperation and Imitation Learning for Loco-manipulation)