/goller

Reinforcement Learning methods in Go for optimal control via generalized policy iteration.

Primary LanguageGoGNU General Public License v2.0GPL-2.0

goller

Reinforcement Learning methods in Go for optimal control via generalized policy iteration.

Example

greedy := policy.EpisilonGreedy(eps)
learner := learn.WithPolicy(greedy, behavior).Q(learnRate)