jonyzhang2023

Embodied AI Researcher

Hong Kong University of Science and Technology

jonyzhang2023's Stars

deepseek-ai/DeepSeek-V3
Language:Python18.9k 148 1181.5k
deepseek-ai/DeepSeek-Coder
DeepSeek Coder: Let the Code Write Itself
Language:Python10.1k 82 166669
NVIDIA/Cosmos
Cosmos is a world model development platform that consists of world foundation models, tokenizers and video processing pipeline to accelerate the development of Physical AI at Robotics & AV labs. Cosmos is purpose built for physical AI. The Cosmos repository will enable end users to run the Cosmos models, run inference scripts and generate videos.
Language:Python6.7k407
FoundationVision/VAR
[NeurIPS 2024 Best Paper][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!
Language:Jupyter Notebook6.7k 120 109447
NVlabs/VILA
VILA is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and cloud.
Language:Python2.7k 39 154217
atong01/conditional-flow-matching
TorchCFM: a Conditional Flow Matching library
Language:Python1.4k 18 59119
OpenDriveLab/AgiBot-World
World's First Large-scale High-quality Robotic Manipulation Benchmark
Language:Python1.2k 18 379
zchoi/Awesome-Embodied-Agent-with-LLMs
This is a curated list of "Embodied AI or robot with Large Language Models" research. Watch this repository for the latest updates! 🔥
1.1k 45 362
orangeduck/Motion-Matching
Learned Motion Matching example implementation and source code for the article "Code vs Data Driven Displacement"
Language:C++727 28 31106
facebookresearch/metamotivo
The first behavioral foundation model to control a virtual physics-based humanoid agent for a wide range of whole-body tasks.
Language:Python513 12 642
alexsax/2D-3D-Semantics
The data skeleton from Joint 2D-3D-Semantic Data for Indoor Scene Understanding
Language:C++477 13 4667
allenzren/open-pi-zero
Re-implementation of pi0 vision-language-action (VLA) model from Physical Intelligence
Language:Python469 8 1025
UMass-Foundation-Model/3D-VLA
[ICML 2024] 3D-VLA: A 3D Vision-Language-Action Generative World Model
Language:Python399 17 814
geng-haoran/Simulately
A universal summary of current robotics simulators
Language:TypeScript360 12 519
Yzichen/FlashOCC
Language:Python342 2 9537
vision-x-nyu/thinking-in-space
Official repo and evaluation implementation of VSI-Bench
Language:Python321 4 521
RL4VLM/RL4VLM
Official Repo for Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning
Language:Jupyter Notebook249 6 2819
xuxw98/ESAM
EmbodiedSAM: Online Segment Any 3D Thing in Real Time
Language:Python248 4 3715
Robot-VLAs/RoboVLMs
Language:Python230 5 76
mbodiai/embodied-agents
Seamlessly integrate state-of-the-art transformer models into robotics stacks
Language:Python176 5 1521
mlzxy/arp
Autoregressive Policy for Robot Learning
Language:Python97 4 146
HRI-EU/flow_matching
Affordance-based Robot Manipulation with Flow Matching
Language:Shell96 3 08
jamycheung/360BEV
Repository of 360BEV
Language:Python91 3 85
LiewFeng/RayDN
[ECCV 2024] Ray Denoising (RayDN): Depth-aware Hard Negative Sampling for Multi-view 3D Object Detection
Language:Python91 2 181
Stanford-ILIAD/openvla-mini
OpenVLA: An open-source vision-language-action model for robotic manipulation.
Language:Python87 1 06
facebookresearch/humenv
HumEnv is an SMPL humanoid environment enabling systematic model comparison and reproducibility
Language:Python78 7 35
ir-lab/bimanual-imitation
Code for paper, "A Comparison of Imitation Learning Algorithms for Bimanual Manipulation" (Drolet et al., 2024)
Language:Python71 2 53
TEA-Lab/Robo-ABC
[ECCV 2024] 🎉 Official repository of "Robo-ABC: Affordance Generalization Beyond Categories via Semantic Correspondence for Robot Manipulation"
Language:Python63 7 31
yuxuanxienova/UnityTerrainConvertor
Language:C#150
MyRepositories-hub/DML-RL
Language:Python2

jonyzhang2023

jonyzhang2023's Stars

deepseek-ai/DeepSeek-V3

deepseek-ai/DeepSeek-Coder

NVIDIA/Cosmos

FoundationVision/VAR

NVlabs/VILA

atong01/conditional-flow-matching

OpenDriveLab/AgiBot-World

zchoi/Awesome-Embodied-Agent-with-LLMs

orangeduck/Motion-Matching

facebookresearch/metamotivo

alexsax/2D-3D-Semantics

allenzren/open-pi-zero

UMass-Foundation-Model/3D-VLA

geng-haoran/Simulately

Yzichen/FlashOCC

vision-x-nyu/thinking-in-space

RL4VLM/RL4VLM

xuxw98/ESAM

Robot-VLAs/RoboVLMs

mbodiai/embodied-agents

mlzxy/arp

HRI-EU/flow_matching

jamycheung/360BEV

LiewFeng/RayDN

Stanford-ILIAD/openvla-mini

facebookresearch/humenv

ir-lab/bimanual-imitation

TEA-Lab/Robo-ABC

yuxuanxienova/UnityTerrainConvertor

MyRepositories-hub/DML-RL