/pong_RL

Training a simple AI using the policy-gradient approach to Reinforcement Learning.

Primary LanguageJupyter Notebook

Watchers