Distributional reinforcement learning algorithm using conjugated discrete distributions (C2D). Data and graphs for stochastic Atari 2600 environments.
Primary LanguagePython