How to implement data-mixing only mentioned in the SVEA paper?

Question

How to implement data-mixing only mentioned in the SVEA paper?

Closed this issue 3 years ago · 5 comments

Congratulation to be accepted by NIPS 2021!!
I wanna ask how to implement data-mixing only mentioned in the SVEA paper?

Answer 1 · 2021-11-12T18:33:33.000Z

Hi, are you referring to our "data-mixing only" ablation? For this ablation, we simply concatenate augmented and unaugmented observations along the batch dimension (for both s_t and s_t+1) after sampling. The other ablation (alpha=0, beta=1) is equivalent to only using augmented copies of s_t and only unaugmented copies for the Q-target.

Answer 2 · 2021-11-14T14:15:21.000Z

Thanks!
Do you mean that the s_t+1 is also augmented for "data-mixing only" ablation?
And what's alpha and beta for "data-mixing only" ablation?

Answer 3 · 2021-11-16T05:26:33.000Z

Correct. Our two ablations ablate each of the two components of SVEA (data-mixing and unaugmented Q-targets). We implemented the "data-mixing only" baseline using the same hyperparameters as for our proposed method (i.e., alpha=0.5, beta=0.5). Translated into code it would be something like this: obs = utils.cat(obs, augmentations.random_conv(obs.clone())); next_obs = utils.cat(next_obs, augmentations.random_conv(next_obs.clone())) following the code style in https://github.com/nicklashansen/dmcontrol-generalization-benchmark/blob/main/src/algorithms/svea.py.

Answer 4 · 2021-11-18T04:10:00.000Z

Thanks a lot !

Answer 5 · 2021-11-22T17:36:54.000Z

Happy to help! I am closing this issue; feel free to open a new issue if you have further questions