An actor-critic approach to solving LQR with only a single trajectory.
Primary LanguagePythonMIT LicenseMIT
This repository is not active