A mini library for Policy Gradients with Parameter-based Exploration, with reference implementation of the ClipUp optimizer (https://arxiv.org/abs/2008.02387) from NNAISENSE.
Primary LanguagePythonBSD 3-Clause "New" or "Revised" LicenseBSD-3-Clause
No issues in this repository yet.