/pql

Parallel Q-Learning: Scaling Off-policy Reinforcement Learning under Massively Parallel Simulation

Primary LanguagePythonMIT LicenseMIT

Watchers