/lunarlander-ppo-clip

PPO Clip first-order method for the LunarLander discrete environment

Primary LanguageJupyter Notebook

PPO Clip for LunarLander

This repository showcases the implementation of a PPO Clip first-order method to solve the LunarLander discrete environment.

Rewards

image

Results

descarga (1) (1)