/c2d

Distributional reinforcement learning algorithm using conjugated discrete distributions (C2D). Data and graphs for stochastic Atari 2600 environments.

Primary LanguagePython

Stargazers