/Transfer-Learning-for-Reinforcement-Learning

a multi-threaded AlphaGo-style agent to iteratively learn from self-play data using Monte-Carlo Tree Search algorithm

Primary LanguagePythonMIT LicenseMIT

Watchers