bytedance/GR-1

Question about the Random_shift_augmentation in training.

YTEP-ZHI opened this issue · 2 comments

Hi @bdrhtw, thanks for open-sourcing this great work. I have a question about the Random_shift_augmentation in training. Is this augmentation only applied during the video prediction stage, or is it applied in both the video prediction and policy learning stages? If it is also used for policy learning, will it cause a misalignment between the pre-recorded actions and the augmented visual observations? Looking forward to your reply.

Using random shift is common specifically for policy learning. You want to make your policy more robust against changes in the viewpoint. Although the shifted part of the image is different every time the relative position of the robot to the objects and the environment remains the same and thats the relevant part. This augmentation helps to prevent overfitting and improves your policy.

@mbreuss Thanks for your answer!