ryanxhr

Deep Reinforcement Learning

UT Austin

Pinned Repositories

BEAR
Pytorch implementation of BEAR in "Stabilizing Off-Policy Q-Learning via Bootstrapping Error Reduction"
Language:Python10 2 01
CPQ
[AAAI 2022] The official implementation of CPQ in "Constraints Penalized Q-learning for Safe Offline Reinforcement Learning"
Language:Python11 2 12
CQL
Implementation of CQL in "Conservative Q-Learning for Offline Reinforcement Learning" based on BRAC family.
Language:Python7 1 00
DeepThermal
[AAAI 2022] The official implementation of "DeepThermal: Combustion Optimization for Thermal Power Generating Units Using Offline Reinforcement Learning"
Language:Python13 1 12
Discrete_IVR
Discrete version of SQL in "Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Regularization"
Language:Python3 2 02
DWBC
[ICML 2022] The official implementation of DWBC in "Discriminator-Weighted Offline Imitation Learning from Suboptimal Demonstrations"
Language:Python34 1 12
Fisher_BRC
Implementation of Fisher_BRC in "Offline Reinforcement Learning with Fisher Divergence Critic Regularization" based on BRAC family.
Language:Python1 1 01
IVR
[ICLR 2023 Oral] The official implementation of SQL and EQL in "Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Regularization"
Language:Python44 2 36
POR
[NeurIPS 2022 Oral] The official implementation of POR in "A Policy-Guided Imitation Approach for Offline Reinforcement Learning"
Language:Python55 3 27
ryanxhr.github.io
🐟 A simple theme for Jekyll. Live at https://eliottvincent.github.io/bay/
Language:JavaScript0 0 02

ryanxhr's Repositories

ryanxhr/POR
[NeurIPS 2022 Oral] The official implementation of POR in "A Policy-Guided Imitation Approach for Offline Reinforcement Learning"
Language:Python55 3 27
ryanxhr/IVR
[ICLR 2023 Oral] The official implementation of SQL and EQL in "Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Regularization"
Language:Python44 2 36
ryanxhr/DWBC
[ICML 2022] The official implementation of DWBC in "Discriminator-Weighted Offline Imitation Learning from Suboptimal Demonstrations"
Language:Python34 1 12
ryanxhr/DeepThermal
[AAAI 2022] The official implementation of "DeepThermal: Combustion Optimization for Thermal Power Generating Units Using Offline Reinforcement Learning"
Language:Python13 1 12
ryanxhr/CPQ
[AAAI 2022] The official implementation of CPQ in "Constraints Penalized Q-learning for Safe Offline Reinforcement Learning"
Language:Python11 2 12
ryanxhr/BEAR
Pytorch implementation of BEAR in "Stabilizing Off-Policy Q-Learning via Bootstrapping Error Reduction"
Language:Python10 2 01
ryanxhr/CQL
Implementation of CQL in "Conservative Q-Learning for Offline Reinforcement Learning" based on BRAC family.
Language:Python7 1 00
ryanxhr/Discrete_IVR
Discrete version of SQL in "Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Regularization"
Language:Python3 2 02
ryanxhr/Fisher_BRC
Implementation of Fisher_BRC in "Offline Reinforcement Learning with Fisher Divergence Critic Regularization" based on BRAC family.
Language:Python1 1 01
ryanxhr/ryanxhr.github.io
🐟 A simple theme for Jekyll. Live at https://eliottvincent.github.io/bay/
Language:JavaScript0 0 02

ryanxhr

Pinned Repositories

BEAR

CPQ

CQL

DeepThermal

Discrete_IVR

DWBC

Fisher_BRC

IVR

POR

ryanxhr.github.io

ryanxhr's Repositories

ryanxhr/POR

ryanxhr/IVR

ryanxhr/DWBC

ryanxhr/DeepThermal

ryanxhr/CPQ

ryanxhr/BEAR

ryanxhr/CQL

ryanxhr/Discrete_IVR

ryanxhr/Fisher_BRC

ryanxhr/ryanxhr.github.io