1349949

1349949's Stars

meta-llama/llama3
The official Meta Llama 3 GitHub site
Language:Python27.7k 230 2733.2k
LargeWorldModel/LWM
Large World Model -- Modeling Text and Video with Millions Context
Language:Python7.2k 66 71554
DepthAnything/Depth-Anything-V2
[NeurIPS 2024] Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation
Language:Python4.2k 35 212365
isaac-sim/IsaacLab
Unified framework for robot learning built on NVIDIA Isaac Sim
Language:Python2.5k 36 8991.1k
NVlabs/VILA
VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)
Language:Python2.2k 37 144184
baaivision/Emu3
Next-Token Prediction is All You Need
Language:Python1.9k 32 5477
openvla/openvla
OpenVLA: An open-source vision-language-action model for robotic manipulation.
Language:Python1.6k 22 176208
openreasoner/openr
OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models
Language:Python1.3k 7 68109
opendilab/LightZero
[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCTS)
Language:Python1.2k 12 112129
DAMO-NLP-SG/VideoLLaMA2
VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs
Language:Python981 10 11965
kakaobrain/rq-vae-transformer
The official implementation of Autoregressive Image Generation using Residual Quantization (CVPR '22)
Language:Jupyter Notebook813 16 2390
leggedrobotics/rsl_rl
Fast and simple implementation of RL algorithms, designed to run fully on GPU.
Language:Python755 33 29206
mlfoundations/datacomp
DataComp: In search of the next generation of multimodal datasets
Language:Python669 17 6457
unitreerobotics/unitree_ros
Language:C++658 22 98276
OpenDriveLab/Vista
[NeurIPS 2024] A Generalizable World Model for Autonomous Driving
Language:Python620 18 4746
robocasa/robocasa
RoboCasa: Large-Scale Simulation of Everyday Tasks for Generalist Robots
Language:Python618 11 10455
huangwl18/ReKep
ReKep: Spatio-Temporal Reasoning of Relational Keypoint Constraints for Robotic Manipulation
Language:Python586 9 3263
lucidrains/magvit2-pytorch
Implementation of MagViT2 Tokenizer in Pytorch
Language:Python578 27 3533
NVlabs/EAGLE
EAGLE: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders
Language:Python559 31 2245
OpenRobotLab/GRUtopia
GRUtopia: Dream General Robots in a City at Scale
Language:Python541 12 2228
1x-technologies/1xgpt
world modeling challenge for humanoid robots
Language:Python394 18 1532
NVlabs/ProtoMotions
Language:Python392 13 3629
mira-space/MiraData
Official repo for paper "MiraData: A Large-Scale Video Dataset with Long Durations and Structured Captions"
Language:Python375 13 1910
LeCAR-Lab/human2humanoid
[IROS 2024] Learning Human-to-Humanoid Real-Time Whole-Body Teleoperation. [CoRL 2024] OmniH2O: Universal and Dexterous Human-to-Humanoid Whole-Body Teleoperation and Learning
Language:Python319 5 3117
tsb0601/MMVP
Language:Python297 10 277
Beckschen/ViTamin
[CVPR 2024] Official implementation of "ViTamin: Designing Scalable Vision Models in the Vision-language Era"
Language:Python188 6 116
microsoft/MoCapAct
A Multi-Task Dataset for Simulated Humanoid Control
Language:Python172 12 1022
wang-fujin/PINN4SOH
A physics-informed neural network for battery SOH estimation
Language:Python150 4 326
TencentARC/ST-LLM
[ECCV 2024🔥] Official implementation of the paper "ST-LLM: Large Language Models Are Effective Temporal Learners"
Language:Python134 7 214
vivym/OmniGen
Language:Python6 1 00