Acmece/rl-collision-avoidance

Query on stage1_1.pth

Opened this issue · 1 comments

Hello,

Could you please explain the difference between loading the policy of stage1_1.pth and no policy at all? Is the stage1_1.pth some form of supervised learning?

Hi, the model in the policy fold is what I have trained.
If you want to train from the scratch, you may run the following to trian the stage 1:

rosrun stage_ros_add_pose_and_crash stageros -g worlds/stage1.world
mpiexec -np 24 python ppo_stage1.py

then modify this line to load the policy of stage 1 and train stage2 afterwards:

rosrun stage_ros_add_pose_and_crash stageros -g worlds/stage2.world
mpiexec -np 44 python ppo_stage2.py