MarcoMeter/recurrent-ppo-truncated-bptt

Baseline implementation of recurrent PPO using truncated BPTT

Jupyter NotebookMIT

Issues

Question Regarding Sequence Length
#17 opened 19 days ago by Davide236
3
Pre-trained Models Do Not Work
#14 opened a year ago by WilliamYue37
7
How to fix the problem with "Segmentation fault (core dumped)"
#13 opened 2 years ago by jiashuncheng
0
Excuse me，how enjoy the model “./models/cartpole_masked.nn”？ When I run enjoy.py ， Show "RuntimeError: expected scalar type Double but found Float "
#9 opened 2 years ago by jialuyu61
4
Can this repo train continuous environments?
#8 opened 2 years ago by 1900360
7
Masked mean for advantage normalization?
#10 opened 2 years ago by finnBsch
8
about sequence_length
#11 opened 2 years ago by xixiha5230
2
Possibility to reference the implementation
#6 opened 2 years ago by RobvanGastel
6
Suggestions for training on multiple environments simultaneously?
#7 opened 2 years ago by fedshyvana
3
Calculation of the Generalized Advantage Estimation
#5 opened 3 years ago by RobvanGastel
5
Adapting the repo to my specific problem
#3 opened 3 years ago by VVIERV00
8