ajlangley/cpo-pytorch

An implementation of Constrained Policy Optimization (Achiam 2017) in PyTorch

Python

Issues

mean kl is always=0
#10 opened 2 years ago by xzhang2523
5
"from envs.ant_gather import AntGatherEnv"
#7 opened 2 years ago by xzhang2523
1
a "bug"? in the cpo method
#8 opened 2 years ago by xzhang2523
1
line 2 lead to imp_sampling=1
#9 opened 2 years ago by xzhang2523
1
mj_loadXML error: b'Error: engine error: Could not allocate memory'
#5 opened 3 years ago by lwyncepu
3
Does it converge?
#6 opened 3 years ago by Bigpig4396
9
Where can i find the AntGather env?
#4 opened 3 years ago by DZ9
1
Nice work
#1 opened 4 years ago by xiaoyuanzh
1
Some questions about the codes
#2 opened 4 years ago by Baiyu6666
1
[question] How to turn my custom environment into an environment suitable for CPO?
#3 opened 4 years ago by kosmylo
1