/rl_crawler

Train a 4 legged crawling agent to walk using Proximal Policy Optimization(PPO)

Primary LanguageJupyter NotebookMIT LicenseMIT

Watchers