Acmece/rl-collision-avoidance

convergence

Opened this issue · 0 comments

sorry to bother you, I want to know how many agents you used in stage_1 those trained in three PC?And my rewards are not convergent, how many eposides you used? Thanks a lot.