Training a simple AI using the policy-gradient approach to Reinforcement Learning.
Primary LanguageJupyter Notebook