For benchmark policy evaluation on Atari. The policies are taken from pre-trained rainbow DQN agent from the Chiner RL model zoo [1].
A small test policy for Pong with sample code is available in test.py.
Link to full policies: Google drive.
You can cite the benchmark as:
@article{javed2023scalable,
title={Scalable real-time recurrent learning using columnar-constructive networks},
author={Javed, Khurram and Shah, Haseeb and Sutton, Richard S and White, Martha},
journal={Journal of Machine Learning Research},
volume={24},
pages={1--34},
year={2023}
}
[1] Fujita, Yasuhiro, et al. "Chainerrl: A deep reinforcement learning library." The Journal of Machine Learning Research 22.1 (2021): 3557-3570.