/acme_r2d2_policy

A thin wrapper around the R2D2 algorithm as implemented in DeepMind's Acme framework

Primary LanguagePython

Watchers