ShangtongZhang/reinforcement-learning-an-introduction
Python Implementation of Reinforcement Learning: An Introduction
PythonMIT
Issues
- 1
The plicy of chapter1
#153 opened by benroo123 - 0
Citing this repository
#162 opened by RensOliemans - 0
- 2
Chapter 2: reset time
#112 opened by sursu - 0
- 1
ch06 random_walk td method
#157 opened by Perseus1993 - 1
Unclear point for the code in Blackjack example
#155 opened by eatam - 0
- 1
problem about chapter04/car_rental.py
#150 opened by shaoeChen - 1
- 0
Problem of excercise 2.5
#152 opened by qiqiJiang-st - 0
typo
#146 opened by arashHaratian - 0
example to use it on human genetic data?
#151 opened by Shicheng-Guo - 0
ten_armed_testbed.py中的figure2_3为何不用“sample_averages”
#149 opened by A-Pai - 2
- 0
wrong figure number for chapter 11
#147 opened by arashHaratian - 1
- 2
something wrong in matplotlib
#139 opened by FYYFU - 2
A simpler draw function
#135 opened by rohitdavas - 0
nit: chapter 6 references
#136 opened by mahiuchun - 1
- 1
No related package on the zip file
#133 opened by leiyongxiang1205 - 3
Help on ten_armed_testbed.py
#128 opened by ai4pharma - 1
Chapter4, gambler problem
#127 opened by 07hyx06 - 12
Chapter 11
#126 opened by mattgithub1919 - 1
- 2
a little confuse about chapter5/blackjack.py
#122 opened by ChenHuaYou - 1
- 1
chapter06/random_wark.py
#123 opened by ChenHuaYou - 1
Reinforcement learning
#120 opened by palbha - 2
- 1
discount factor for Chapter 10
#118 opened by roachsinai - 2
Tile Coding scaling issue
#116 opened by MJeremy2017 - 1
Misunderstanding in chapter 2
#117 opened by zZthebreakerZz - 1
- 1
Chapter 4:seems missing self. before TRUNCATE
#113 opened by ZiqiChai - 1
epilon not initialized
#104 opened by abhinavsagar - 1
- 1
- 1
Chapter 09: Random Walk 100
#101 opened by xenomeno - 1
Would it be OK to publish solutions to the programming exercises alongside mainly the algorithms I intend to implement from the book?
#99 opened by brancoliticus - 2
- 1
- 0
Just a Thank you note
#96 opened by wassimseif - 2
Chapter 4 jacks car rental
#93 opened by HareshKarnan - 1
_
#92 opened by hitblackjack - 1
Problem I meet in how TD method and MC method update the last state-value in a MRP
#91 opened by xingE650 - 1
chapter2_content.tex exercise 2.3 问题
#89 opened by RocStone - 1
- 1
Q-learning Example Has No @expected
#84 opened by LinaeSostra