ddpg-lunar-lander: A Jupyter Notebook repository from sn2727

DDPG

Untrained	Trained Actor

This repository contains an implementation of Deep Deterministic Policy Gradients (DDPG) which is a model-free reinforcement learning algorithm designed for continuous action spaces, utilizing an actor-critic architecture.

In ddpg.ipynb DDPG is implemented and well explained. It is then used to safely land a lunarlander from Gymnasium environments.

sn2727/ddpg-lunar-lander

DDPG