/LunarLander-ReinforcementLearning

In this project, I created an agent using the PPO algorithm from stable baselines3 to complete a task in the LunarLander environment. The agent was trained using reinforcement learning techniques to maximize its performance in the task. The resulting model was able to achieve a high level of success in the LunarLander environment.

Primary LanguageJupyter Notebook

LunarLander-ReinforcementLearning

Creating agent that can land lunar lander.

LunarLander.mp4