Robert-gyj's Stars
showlab/Show-o
Repository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.
jacob-zietek/awesome-world-models-manipulation
Awesome world models for manipulation
TencentARC/Open-MAGVIT2
Open-MAGVIT2: Democratizing Autoregressive Visual Generation
1x-technologies/1xgpt
world modeling challenge for humanoid robots
universome/stylegan-v
[CVPR 2022] StyleGAN-V: A Continuous Video Generator with the Price, Image Quality and Perks of StyleGAN2
bytedance/IRASim
alibaba/animate-anything
Fine-Grained Open Domain Image Animation with Motion Guidance
pixeli99/SVD_Xtend
Stable Video Diffusion Training Code and Extensions.
Stability-AI/generative-models
Generative Models by Stability AI
AILab-CVC/VideoCrafter
VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models
Doubiiu/DynamiCrafter
[ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors
ChenHsing/Awesome-Video-Diffusion-Models
[CSUR] A Survey on Video Diffusion Models
maitrix-org/Pandora
Pandora: Towards General World Model with Natural Language Actions and Video States
jonbarron/website
intuitive-robots/mdt_policy
[RSS 2024] Code for "Multimodal Diffusion Transformer: Learning Versatile Behavior from Multimodal Goals" for CALVIN experiments with pre-trained weights
EDiRobotics/GR1-Training
A generalized policy for robotics manipulation
nickgkan/3d_diffuser_actor
Code for the paper "3D Diffuser Actor: Policy Diffusion with 3D Scene Representations"
JingyunLiang/SwinIR
SwinIR: Image Restoration Using Swin Transformer (official repository)
zsyOAOA/ResShift
ResShift: Efficient Diffusion Model for Image Super-resolution by Residual Shifting (NeurIPS@2023 Spotlight, TPAMI@2024)
joelibaceta/video-keyframe-detector
It is a simple python tool to extract key-frames from a video file using peak estimation from frame difference.
chuanyangjin/fast-DiT
Fast Diffusion Models with Transformers
hpcaitech/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
facebookresearch/DiT
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
rail-berkeley/serl_franka_controllers
Cartesian impedance controller with reference limiting for Franka Emika Robot
rail-berkeley/serl
SERL: A Software Suite for Sample-Efficient Robotic Reinforcement Learning
octo-models/octo
Octo is a transformer-based robot policy trained on a diverse mix of 800k robot trajectories.
Yingdong-Hu/PVM-Robotics
The repository for a thorough empirical evaluation of pre-trained vision model performance across different downstream policy learning methods.
google-research/relay-policy-learning
Farama-Foundation/Gymnasium-Robotics
A collection of robotics simulation environments for reinforcement learning
Farama-Foundation/Gymnasium
An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)