MichaelFish199/BipedalWalker-ReinforcementLearning
In this project I create agent for the BipedalWalker environment using the Proximal Policy Optimization (PPO) algorithm from the stablebaselines3 library. The agent is trained to navigate the BipedalWalker environment, which is a simulated robot with two legs.
Jupyter Notebook
Stargazers
No one’s star this repository yet.