Pinned Repositories
BEAR
Pytorch implementation of BEAR in "Stabilizing Off-Policy Q-Learning via Bootstrapping Error Reduction"
CPQ
[AAAI 2022] The official implementation of CPQ in "Constraints Penalized Q-learning for Safe Offline Reinforcement Learning"
CQL
Implementation of CQL in "Conservative Q-Learning for Offline Reinforcement Learning" based on BRAC family.
DeepThermal
[AAAI 2022] The official implementation of "DeepThermal: Combustion Optimization for Thermal Power Generating Units Using Offline Reinforcement Learning"
Discrete_IVR
Discrete version of SQL in "Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Regularization"
DWBC
[ICML 2022] The official implementation of DWBC in "Discriminator-Weighted Offline Imitation Learning from Suboptimal Demonstrations"
Fisher_BRC
Implementation of Fisher_BRC in "Offline Reinforcement Learning with Fisher Divergence Critic Regularization" based on BRAC family.
IVR
[ICLR 2023 Oral] The official implementation of SQL and EQL in "Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Regularization"
POR
[NeurIPS 2022 Oral] The official implementation of POR in "A Policy-Guided Imitation Approach for Offline Reinforcement Learning"
ryanxhr.github.io
🐟 A simple theme for Jekyll. Live at https://eliottvincent.github.io/bay/
ryanxhr's Repositories
ryanxhr/POR
[NeurIPS 2022 Oral] The official implementation of POR in "A Policy-Guided Imitation Approach for Offline Reinforcement Learning"
ryanxhr/IVR
[ICLR 2023 Oral] The official implementation of SQL and EQL in "Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Regularization"
ryanxhr/DWBC
[ICML 2022] The official implementation of DWBC in "Discriminator-Weighted Offline Imitation Learning from Suboptimal Demonstrations"
ryanxhr/DeepThermal
[AAAI 2022] The official implementation of "DeepThermal: Combustion Optimization for Thermal Power Generating Units Using Offline Reinforcement Learning"
ryanxhr/CPQ
[AAAI 2022] The official implementation of CPQ in "Constraints Penalized Q-learning for Safe Offline Reinforcement Learning"
ryanxhr/BEAR
Pytorch implementation of BEAR in "Stabilizing Off-Policy Q-Learning via Bootstrapping Error Reduction"
ryanxhr/CQL
Implementation of CQL in "Conservative Q-Learning for Offline Reinforcement Learning" based on BRAC family.
ryanxhr/Discrete_IVR
Discrete version of SQL in "Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Regularization"
ryanxhr/Fisher_BRC
Implementation of Fisher_BRC in "Offline Reinforcement Learning with Fisher Divergence Critic Regularization" based on BRAC family.
ryanxhr/ryanxhr.github.io
🐟 A simple theme for Jekyll. Live at https://eliottvincent.github.io/bay/