/GamblersProblem

Reinforcement learning algorithms (policy iteration and value iteration) for the gambler's problem.

Primary LanguageJulia

Watchers