sweetice/Deep-reinforcement-learning-with-pytorch

PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....

PythonMIT

Issues

program error in gridworld.py
#46 opened a month ago by Wei-yao-Cheng
0
Bugs in PPO
#6 opened 5 years ago by moonblue333
6
wrong code in SAC
#43 opened a year ago by QinwenLuo
1
Error while seeding!
#45 opened a year ago by CajetanRodrigues
1
A problem in Chapter 5: DDPG
#17 opened 4 years ago by MoonieC
2
Big bug in PPO2
#35 opened 2 years ago by Vinson-sheep
3
One parameter is missing in DDPG Code
#32 opened a year ago by catchy666
1
SAC_Bug
#38 opened 2 years ago by aut6620
3
SAC Bugs
#25 opened 3 years ago by ZiyiLiubird
3
bug in reinforce with baseline
#37 opened 2 years ago by hlhang9527
2
Temperature factor missing in SAC !!!
#36 opened 2 years ago by Darkness-hy
1
what is your training performance on mujoco?
#33 opened 2 years ago by xiaoyuanzh
0
About PPO
#24 opened 3 years ago by LpLegend
3
about the advantage values in PPO2
#30 opened 3 years ago by Hardlygo
0
I dont think PPO pendulum is converging
#7 opened 5 years ago by Bigpig4396
4
question to author
#28 opened 3 years ago by FulChou
0
Are there TRPO implementations?
#27 opened 3 years ago by ChenDRAG
0
About updating.
#26 opened 3 years ago by Michi-123
0
Some of the code was copied from Morvan
#23 opened 3 years ago by xiaoxuh
0
About SAC's version bug (gym 0.17.3)
#21 opened 4 years ago by CoderAT13
1
Cannot solve the Pendulum problem by PPO iml in Chapter 07
#19 opened 4 years ago by ryanhuang1014
0
Problem about DDPG
#18 opened 4 years ago by yikeqingli
0
a bug in DQN.py
#15 opened 4 years ago by karlhjm
0
about TRPO
#12 opened 5 years ago by wangyy161
0
Confused about different action_sample way in SAC
#11 opened 5 years ago by ocean1211
0
提个小建议
#10 opened 5 years ago by sunnyswag
0
why tensorboardX's abscissa is not accurate?
#9 opened 5 years ago by xiaoxuh
0
Possible issue with policy decay in TD3.
#8 opened 5 years ago by WillBrennan
1
Char 05 DDPG missing exploration noise
#3 opened 5 years ago by tomas-gajarsky
1
Char 05 DDPG: step index and episode index
#5 opened 5 years ago by xiangzz
1
confused about the calculation of R in PPO
#4 opened 5 years ago by LiuShangYuan
0
Char05 DDPG missing script modules for imports
#2 opened 5 years ago by tomas-gajarsky
1