Pinned Repositories
baselines
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
BCQ
Author's PyTorch implementation of BCQ for continuous and discrete actions
BEAR
Code for Stabilizing Off-Policy RL via Bootstrapping Error Reduction
cped
The code implementation of paper "Constrained Policy Optimization with Explicit Behavior Density for Offline Reinforcement Learning"
free-programming-books-zh_CN
:books: 免费的计算机编程类中文书籍,欢迎投稿
HowToCook
程序员在家做饭方法指南。Programmer's guide about how to cook at home (Simplified Chinese only).
iql-pytorch
Unofficial PyTorch implementation (replicating paper results) of Implicit Q-Learning (In-sample Q-Learning) for offline RL
IVR
[ICLR 2023 Oral] The official implementation of SQL and EQL in "Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Regularization"
linux-command
Linux命令大全搜索工具,内容包含Linux命令手册、详解、学习、搜集。https://git.io/linux
MCQ
Code for Mildly Conservative Q-learning for Offline Reinforcement Learning (NeurIPS 2022)
xiaowei2013-2026's Repositories
xiaowei2013-2026/linux-command
Linux命令大全搜索工具,内容包含Linux命令手册、详解、学习、搜集。https://git.io/linux
xiaowei2013-2026/HowToCook
程序员在家做饭方法指南。Programmer's guide about how to cook at home (Simplified Chinese only).
xiaowei2013-2026/cped
The code implementation of paper "Constrained Policy Optimization with Explicit Behavior Density for Offline Reinforcement Learning"
xiaowei2013-2026/free-programming-books-zh_CN
:books: 免费的计算机编程类中文书籍,欢迎投稿
xiaowei2013-2026/baselines
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
xiaowei2013-2026/SVR
Code for Supported Value Regularization for Offline Reinforcement Learning
xiaowei2013-2026/PRDC
Author's PyTorch implementation of ICML'23 paper "Policy Regularization with Dataset Constraint for Offline Reinforcement Learning" for D4RL gym and AntMaze tasks.
xiaowei2013-2026/OEMA
Official PyTorch code for "Sample Efficient Offline-to-Online Reinforcement Learning" in TKDE'23.
xiaowei2013-2026/IVR
[ICLR 2023 Oral] The official implementation of SQL and EQL in "Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Regularization"
xiaowei2013-2026/TD3
Author's PyTorch implementation of TD3 for OpenAI gym tasks
xiaowei2013-2026/MCQ
Code for Mildly Conservative Q-learning for Offline Reinforcement Learning (NeurIPS 2022)
xiaowei2013-2026/iql-pytorch
Unofficial PyTorch implementation (replicating paper results) of Implicit Q-Learning (In-sample Q-Learning) for offline RL
xiaowei2013-2026/SPOT
Code release for "Supported Policy Optimization for Offline Reinforcement Learning" (NeurIPS 2022), https://arxiv.org/abs/2202.06239
xiaowei2013-2026/wPC
Implementation for " weighted policy constraints for offline reinforcement learning"
xiaowei2013-2026/SBAC
Facebear's minimal implementation of SBAC (Soft behavior regularized actor critic, NIPS22 offline RL workshop)
xiaowei2013-2026/TD3_BC
Author's PyTorch implementation of TD3+BC, a simple variant of TD3 for offline RL
xiaowei2013-2026/SAC
PyTorch implementation of Soft Actor-Critic (SAC)
xiaowei2013-2026/BCQ
Author's PyTorch implementation of BCQ for continuous and discrete actions
xiaowei2013-2026/BEAR
Code for Stabilizing Off-Policy RL via Bootstrapping Error Reduction
xiaowei2013-2026/thu-cst-cracker
清华大学计算机系课程攻略