xiaowei2013-2026

JiLin University

Pinned Repositories

baselines
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
Language:Python00
BCQ
Author's PyTorch implementation of BCQ for continuous and discrete actions
Language:Python00
BEAR
Code for Stabilizing Off-Policy RL via Bootstrapping Error Reduction
Language:Python00
cped
The code implementation of paper "Constrained Policy Optimization with Explicit Behavior Density for Offline Reinforcement Learning"
Language:Python00
free-programming-books-zh_CN
:books: 免费的计算机编程类中文书籍，欢迎投稿
00
HowToCook
程序员在家做饭方法指南。Programmer's guide about how to cook at home (Simplified Chinese only).
Language:Shell00
iql-pytorch
Unofficial PyTorch implementation (replicating paper results) of Implicit Q-Learning (In-sample Q-Learning) for offline RL
Language:Python00
IVR
[ICLR 2023 Oral] The official implementation of SQL and EQL in "Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Regularization"
Language:Python00
linux-command
Linux命令大全搜索工具，内容包含Linux命令手册、详解、学习、搜集。https://git.io/linux
Language:Markdown00
MCQ
Code for Mildly Conservative Q-learning for Offline Reinforcement Learning (NeurIPS 2022)
Language:Python00

xiaowei2013-2026's Repositories

xiaowei2013-2026/linux-command
Linux命令大全搜索工具，内容包含Linux命令手册、详解、学习、搜集。https://git.io/linux
xiaowei2013-2026/HowToCook
程序员在家做饭方法指南。Programmer's guide about how to cook at home (Simplified Chinese only).
xiaowei2013-2026/cped
The code implementation of paper "Constrained Policy Optimization with Explicit Behavior Density for Offline Reinforcement Learning"
xiaowei2013-2026/free-programming-books-zh_CN
:books: 免费的计算机编程类中文书籍，欢迎投稿
xiaowei2013-2026/baselines
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
xiaowei2013-2026/SVR
Code for Supported Value Regularization for Offline Reinforcement Learning
xiaowei2013-2026/PRDC
Author's PyTorch implementation of ICML'23 paper "Policy Regularization with Dataset Constraint for Offline Reinforcement Learning" for D4RL gym and AntMaze tasks.
xiaowei2013-2026/OEMA
Official PyTorch code for "Sample Efficient Offline-to-Online Reinforcement Learning" in TKDE'23.
xiaowei2013-2026/IVR
[ICLR 2023 Oral] The official implementation of SQL and EQL in "Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Regularization"
xiaowei2013-2026/TD3
Author's PyTorch implementation of TD3 for OpenAI gym tasks
xiaowei2013-2026/MCQ
Code for Mildly Conservative Q-learning for Offline Reinforcement Learning (NeurIPS 2022)
xiaowei2013-2026/iql-pytorch
Unofficial PyTorch implementation (replicating paper results) of Implicit Q-Learning (In-sample Q-Learning) for offline RL
xiaowei2013-2026/SPOT
Code release for "Supported Policy Optimization for Offline Reinforcement Learning" (NeurIPS 2022), https://arxiv.org/abs/2202.06239
xiaowei2013-2026/wPC
Implementation for " weighted policy constraints for offline reinforcement learning"
xiaowei2013-2026/SBAC
Facebear's minimal implementation of SBAC (Soft behavior regularized actor critic, NIPS22 offline RL workshop)
xiaowei2013-2026/TD3_BC
Author's PyTorch implementation of TD3+BC, a simple variant of TD3 for offline RL
xiaowei2013-2026/SAC
PyTorch implementation of Soft Actor-Critic (SAC)
xiaowei2013-2026/BCQ
Author's PyTorch implementation of BCQ for continuous and discrete actions
xiaowei2013-2026/BEAR
Code for Stabilizing Off-Policy RL via Bootstrapping Error Reduction
xiaowei2013-2026/thu-cst-cracker
清华大学计算机系课程攻略

xiaowei2013-2026

Pinned Repositories

baselines

BCQ

BEAR

cped

free-programming-books-zh_CN

HowToCook

iql-pytorch

IVR

linux-command

MCQ

xiaowei2013-2026's Repositories

xiaowei2013-2026/linux-command

xiaowei2013-2026/HowToCook

xiaowei2013-2026/cped

xiaowei2013-2026/free-programming-books-zh_CN

xiaowei2013-2026/baselines

xiaowei2013-2026/SVR

xiaowei2013-2026/PRDC

xiaowei2013-2026/OEMA

xiaowei2013-2026/IVR

xiaowei2013-2026/TD3

xiaowei2013-2026/MCQ

xiaowei2013-2026/iql-pytorch

xiaowei2013-2026/SPOT

xiaowei2013-2026/wPC

xiaowei2013-2026/SBAC

xiaowei2013-2026/TD3_BC

xiaowei2013-2026/SAC

xiaowei2013-2026/BCQ

xiaowei2013-2026/BEAR

xiaowei2013-2026/thu-cst-cracker