Could you please add the way to deal with rewards with same steps in a multi processes training?
qiuruiyu opened this issue · 2 comments
qiuruiyu commented
Xiong5Heng commented
Hi, I also meet the same challenge, have you solved it?
qiuruiyu commented
Hi, I also meet the same challenge, have you solved it?
No... I didn't manage to solve the problem because it seems that the curve looks no problem..