/BipedalWalker-ReinforcementLearning

In this project I create agent for the BipedalWalker environment using the Proximal Policy Optimization (PPO) algorithm from the stablebaselines3 library. The agent is trained to navigate the BipedalWalker environment, which is a simulated robot with two legs.

Primary LanguageJupyter Notebook

Stargazers

No one’s star this repository yet.