Reinforcement learning algorithms (policy iteration and value iteration) for the gambler's problem.
Primary LanguageJulia