gxywy/rl-plotter

Could you please add the way to deal with rewards with same steps in a multi processes training?

Opened this issue · 2 comments

In my training process, I use a multi-processes PPO.
When I want to draw the reward curve with rl-plotter, I found that:
image
Just like the image, there are rewards with the same steps. But it seems that it only shows one point in the curve?

Hi, I also meet the same challenge, have you solved it?

Hi, I also meet the same challenge, have you solved it?

No... I didn't manage to solve the problem because it seems that the curve looks no problem..