Develop-Packt/Introduction-to-Policy-Based-Methods-for-Reinforcement-Learning
This module looks at policy based methods of reinforcement learning, principally the drawbacks to value based methods like Q learning that motivate the use of policy gradients.
Jupyter NotebookMIT