breez3young
PhD Student @ Tsinghua University, interested in RL, MARL, Embodied AI and World Models.
Tsinghua University
Pinned Repositories
breez3young
daydreamer
Variant of DayDreamer: World Models for Physical Robot Learning
diamond
DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model. NeurIPS 2024 Spotlight.
iris
Transformers are Sample-Efficient World Models. ICLR 2023, notable top 5%.
minerl
MineRL Competition for Sample Efficient Reinforcement Learning - Python Package
Paper-List
record the waiting/done list of papers to read
perceiver-io
A PyTorch implementation of Perceiver, Perceiver IO and Perceiver AR with PyTorch Lightning scripts for distributed training
tdmpc
Code for "Temporal Difference Learning for Model Predictive Control"
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
breez3young's Repositories
breez3young/breez3young
breez3young/daydreamer
Variant of DayDreamer: World Models for Physical Robot Learning
breez3young/diamond
DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model. NeurIPS 2024 Spotlight.
breez3young/iris
Transformers are Sample-Efficient World Models. ICLR 2023, notable top 5%.
breez3young/minerl
MineRL Competition for Sample Efficient Reinforcement Learning - Python Package
breez3young/Paper-List
record the waiting/done list of papers to read
breez3young/perceiver-io
A PyTorch implementation of Perceiver, Perceiver IO and Perceiver AR with PyTorch Lightning scripts for distributed training
breez3young/tdmpc
Code for "Temporal Difference Learning for Model Predictive Control"
breez3young/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.