Develop-Packt/Introduction-to-Policy-Based-Methods-for-Reinforcement-Learning

This module looks at policy based methods of reinforcement learning, principally the drawbacks to value based methods like Q learning that motivate the use of policy gradients.

Jupyter NotebookMIT

Introduction to Policy Based Methods for Reinforcement Learning