ZifanWu

Pinned Repositories

CAL
Code accompanying the paper "Off-Policy Primal-Dual Safe Reinforcement Learning"
Language:Python12 3 24
Coordinated-PPO
Code accompanying paper "Coordinated Proximal Policy Optimization"
Language:Python11 0 13
dopamine
Dopamine is a research framework for fast prototyping of reinforcement learning algorithms.
Language:Jupyter Notebook00
DrQ-v2_in_Jax
Language:Jupyter Notebook10
fitting-random-labels
Example code for the paper "Understanding deep learning requires rethinking generalization"
Language:Python00
Jax_HL_Gauss_loss_on_DMControl
Jax implementation of HL-Gauss loss (from the paper "Stop Regressing: Training Value Functions via Classification for Scalable Deep RL") on top of DRL algorithms.
Language:Jupyter Notebook0 1 00
jaxpruner
Language:Python00
MAG
Code accompanying paper "Models as Agents: Optimizing Multi-Step Predictions of Interactive Local Models in Model-Based Multi-Agent Reinforcement Learning".
Language:Python14 2 12
mbpo_pytorch
A pytorch reprelication of the model-based reinforcement learning algorithm MBPO
Language:Python0 0 00
Plan-to-Predict
Code accompanying paper "Plan To Predict: Learning an Uncertainty-Foreseeing Model for Model-Based Reinforcement Learning".
Language:Python8 2 05

ZifanWu's Repositories

ZifanWu/MAG
Code accompanying paper "Models as Agents: Optimizing Multi-Step Predictions of Interactive Local Models in Model-Based Multi-Agent Reinforcement Learning".
Language:Python14 2 12
ZifanWu/CAL
Code accompanying the paper "Off-Policy Primal-Dual Safe Reinforcement Learning"
Language:Python12 3 24
ZifanWu/Coordinated-PPO
Code accompanying paper "Coordinated Proximal Policy Optimization"
Language:Python11 0 13
ZifanWu/Plan-to-Predict
Code accompanying paper "Plan To Predict: Learning an Uncertainty-Foreseeing Model for Model-Based Reinforcement Learning".
Language:Python8 2 05
ZifanWu/DrQ-v2_in_Jax
Language:Jupyter Notebook10
ZifanWu/dopamine
Dopamine is a research framework for fast prototyping of reinforcement learning algorithms.
Language:Jupyter Notebook00
ZifanWu/fitting-random-labels
Example code for the paper "Understanding deep learning requires rethinking generalization"
Language:Python00
ZifanWu/Jax_HL_Gauss_loss_on_DMControl
Jax implementation of HL-Gauss loss (from the paper "Stop Regressing: Training Value Functions via Classification for Scalable Deep RL") on top of DRL algorithms.
Language:Jupyter Notebook0 1 00
ZifanWu/jaxpruner
Language:Python00
ZifanWu/mbpo_pytorch
A pytorch reprelication of the model-based reinforcement learning algorithm MBPO
Language:Python0 0 00
ZifanWu/mbrl-lib
Library for Model Based RL
Language:Python0 0 00
ZifanWu/omnisafe
OmniSafe is an infrastructural framework for accelerating SafeRL research.
Language:Python0 0 00
ZifanWu/pytorch-mopo
re-implementation of the offline model-based RL algorithm MOPO in pytorch
Language:Python0 0 00
ZifanWu/SAC-Lagrangian
PyTorch implementation of Constrained Reinforcement Learning for Soft Actor Critic Algorithm
Language:Python0 0 00
ZifanWu/Safe-RL
Language:Python0 0 00