yangzhao-666

PhD student @ LIACS, and doing reinforcement learning.

LIACS, Leiden UniversityLeiden, the NL

Pinned Repositories

2m
Code for "Two-Memory Reinforcement Learning", COG 2023. A general framework to combine non-parametric episodic memory method and parametric deep reinforcement learning method.
Language:Python1 2 00
cec
Code for "Continuous Episodic Control", COG 2023. A non-parametric method for continuous control tasks.
2 1 00
common-mujoco-errors
This is a repository intends to summarise common errors people encounter while setting up mujoco_py experiments.
00
medal
Code to reproduce results for MEDAL in PyTorch. Also contains code for running SAC and FBRL.
Language:Python0 1 00
PbRSS
The code used for BNAIC 2021 paper "Potential-based Reward Shaping in Sokoban"
Language:Python2 2 01
Reinforcement-Learning
This is the assignment of RL, Leiden University
Language:Python0 1 00
sample-efficient-bayesian-rl
Source for the sample efficient tabular RL submission to the 2019 NIPS workshop on Biological and Artificial RL
Language:Jupyter Notebook0 1 00
TLCLS
The code used for BNAIC 2021 paper "Transfer Learning and Curriculum Learning in Sokoban"
Language:Python1 2 00
yangzhao-666.github.io
Personal webpage of Zhao Yang, originally forked from Kenneth Li
Language:HTML0 0 00
YoungChat
因毕业设计需要，计划使用mfc编写一款IM
Language:C++3 1 01

yangzhao-666/YoungChat
因毕业设计需要，计划使用mfc编写一款IM
Language:C++3 1 01
yangzhao-666/cec
Code for "Continuous Episodic Control", COG 2023. A non-parametric method for continuous control tasks.
2 1 00
yangzhao-666/PbRSS
The code used for BNAIC 2021 paper "Potential-based Reward Shaping in Sokoban"
Language:Python2 2 01
yangzhao-666/2m
Code for "Two-Memory Reinforcement Learning", COG 2023. A general framework to combine non-parametric episodic memory method and parametric deep reinforcement learning method.
Language:Python1 2 00
yangzhao-666/TLCLS
The code used for BNAIC 2021 paper "Transfer Learning and Curriculum Learning in Sokoban"
Language:Python1 2 00
yangzhao-666/common-mujoco-errors
This is a repository intends to summarise common errors people encounter while setting up mujoco_py experiments.
00
yangzhao-666/medal
Code to reproduce results for MEDAL in PyTorch. Also contains code for running SAC and FBRL.
Language:Python0 1 00
yangzhao-666/Reinforcement-Learning
This is the assignment of RL, Leiden University
Language:Python0 1 00
yangzhao-666/sample-efficient-bayesian-rl
Source for the sample efficient tabular RL submission to the 2019 NIPS workshop on Biological and Artificial RL
Language:Jupyter Notebook0 1 00
yangzhao-666/yangzhao-666.github.io
Personal webpage of Zhao Yang, originally forked from Kenneth Li
Language:HTML0 0 00