Pinned Repositories
asyua_baseline
The main focus is to replicate some RL algorithms, divided into discrete and continuous environments.
DRL
deepRL,in offline,offlinetoonline,others,some paper reproduce,learn and share my experience
github-slideshow
A robot powered training repository :robot:
marios-PPO
marios for ppo,pytorch
Super-Mario-RL
🍄Reinforcement Learning: Super Mario Bros with dueling dqn🍄
PolicyGradientsJax
On-Policy Policy Gradient Algorithms in JAX
SO2
[AAAI2024] A Perspective of Q-value Estimation on Offline-to-Online Reinforcement Learning
asyua-ye's Repositories
asyua-ye/marios-PPO
marios for ppo,pytorch
asyua-ye/asyua_baseline
The main focus is to replicate some RL algorithms, divided into discrete and continuous environments.
asyua-ye/DRL
deepRL,in offline,offlinetoonline,others,some paper reproduce,learn and share my experience
asyua-ye/github-slideshow
A robot powered training repository :robot: