/Introduction-to-Policy-Based-Methods-for-Reinforcement-Learning

This module looks at policy based methods of reinforcement learning, principally the drawbacks to value based methods like Q learning that motivate the use of policy gradients.

Primary LanguageJupyter NotebookMIT LicenseMIT

Introduction to Policy Based Methods for Reinforcement Learning

GitHub issues GitHub forks GitHub stars PRs Welcome