nicklashansen/dmcontrol-generalization-benchmark

How to implement data-mixing only mentioned in the SVEA paper?

Closed this issue · 5 comments

Congratulation to be accepted by NIPS 2021!!
I wanna ask how to implement data-mixing only mentioned in the SVEA paper?

Hi, are you referring to our "data-mixing only" ablation? For this ablation, we simply concatenate augmented and unaugmented observations along the batch dimension (for both s_t and s_t+1) after sampling. The other ablation (alpha=0, beta=1) is equivalent to only using augmented copies of s_t and only unaugmented copies for the Q-target.

Thanks!
Do you mean that the s_t+1 is also augmented for "data-mixing only" ablation?
And what's alpha and beta for "data-mixing only" ablation?

Correct. Our two ablations ablate each of the two components of SVEA (data-mixing and unaugmented Q-targets). We implemented the "data-mixing only" baseline using the same hyperparameters as for our proposed method (i.e., alpha=0.5, beta=0.5). Translated into code it would be something like this: obs = utils.cat(obs, augmentations.random_conv(obs.clone())); next_obs = utils.cat(next_obs, augmentations.random_conv(next_obs.clone())) following the code style in https://github.com/nicklashansen/dmcontrol-generalization-benchmark/blob/main/src/algorithms/svea.py.

Thanks a lot !

Happy to help! I am closing this issue; feel free to open a new issue if you have further questions