Question: batch_size=1 for Transporter models?

Question

Question: batch_size=1 for Transporter models?

MohitShridhar opened this issue 4 years ago · 4 comments

Thanks for open-sourcing this great work!

I noticed that all the Transporter models are trained with a batch size of 1. Is there anything preventing the use of larger batches other than memory?

Answer 1 · 2021-02-09T14:41:41.000Z

Thanks Mohit! Other than memory, there shouldn't be anything preventing the use of larger batches. Note that the input rotations expand the batch size to the number of rotations (default 36). So a batch size of 2 should be expanded to 2 * n_rotations. To support this, you may need to make a few changes to the data preprocessing and loss function.

Answer 2 · 2021-02-09T17:29:18.000Z

Awesome, thanks!

Answer 3 · 2021-02-18T23:31:39.000Z

Hi Andy, sorry, just a quick follow-up: for the evaluations in Table 2 of the paper, were the agents trained for a fixed number of iterations (1k? 10k?). Or did you pick the agents with the best validation loss for each run?

Answer 4 · 2021-02-20T02:06:39.000Z

Hi Mohit, I believe we trained the agents for multiple fixed iterations (1K, 2K, 5K, 10K, 20K, 40K), then picked the iteration with the best average validation loss between seeds.