Code for the paper "Discount Factor as a Regularizer in Reinforcement Learning", Ron Amit, Ron Meir, Kamil Ciosek, ICML 2020. [Paper] [Slides] [Video] [bib]
DDPG and TD3 code is based on: https://github.com/sfujim/TD3
- Python 3.7
- NumPy, Matplotlib, and seaborn
- Ray 0.84
All results are saved in this zip file (including complete parameters, raw results, and figures).