ShangtongZhang/reinforcement-learning-an-introduction

Python Implementation of Reinforcement Learning: An Introduction

PythonMIT

Issues

The plicy of chapter1
#153 opened 3 years ago by benroo123
1
Citing this repository
#162 opened 2 years ago by RensOliemans
0
Chapter 2: Couldn't find the file '../images/figure_2_1.png'
#161 opened 2 years ago by Zhangxiaoyi688
0
Chapter 2: reset time
#112 opened 5 years ago by sursu
2
chapter4 gamblers_problem, showing multiple best actions
#158 opened 2 years ago by itschenxi
0
ch06 random_walk td method
#157 opened 2 years ago by Perseus1993
1
Unclear point for the code in Blackjack example
#155 opened 3 years ago by eatam
1
l
#156 opened 2 years ago by Karp8841
0
problem about chapter04/car_rental.py
#150 opened 3 years ago by shaoeChen
1
Wrong Bellman equation for Jack's car rental problem?
#154 opened 3 years ago by Raymondliz
1
Problem of excercise 2.5
#152 opened 3 years ago by qiqiJiang-st
0
typo
#146 opened 3 years ago by arashHaratian
0
example to use it on human genetic data?
#151 opened 3 years ago by Shicheng-Guo
0
ten_armed_testbed.py中的figure2_3为何不用“sample_averages”
#149 opened 3 years ago by A-Pai
0
Generalization to abstract classes for Environment/Agents?
#141 opened 4 years ago by chicotobi
2
wrong figure number for chapter 11
#147 opened 4 years ago by arashHaratian
0
tictactoe compete() plays 1000 almost identical games
#145 opened 4 years ago by gsverhoeven
1
something wrong in matplotlib
#139 opened 4 years ago by FYYFU
2
A simpler draw function
#135 opened 4 years ago by rohitdavas
2
nit: chapter 6 references
#136 opened 4 years ago by mahiuchun
0
Unable to get the same results while formulating differently
#134 opened 4 years ago by rohitdavas
1
No related package on the zip file
#133 opened 4 years ago by leiyongxiang1205
1
Help on ten_armed_testbed.py
#128 opened 5 years ago by ai4pharma
3
Chapter4, gambler problem
#127 opened 5 years ago by 07hyx06
1
Chapter 11
#126 opened 5 years ago by mattgithub1919
12
chapter04/car_rental_synchronous.py: the table needs to be flipped.
#124 opened 5 years ago by QuangTran4810
1
a little confuse about chapter5/blackjack.py
#122 opened 5 years ago by ChenHuaYou
2
chap1/tic_tac_toc.py why does make td_error zero when exploring
#125 opened 5 years ago by GarfieldF
1
chapter06/random_wark.py
#123 opened 5 years ago by ChenHuaYou
1
Reinforcement learning
#120 opened 5 years ago by palbha
1
chapter04/gamblers_problem.py line33 to 62 may has a problem
#121 opened 5 years ago by ChenHuaYou
2
discount factor for Chapter 10
#118 opened 5 years ago by roachsinai
1
Tile Coding scaling issue
#116 opened 5 years ago by MJeremy2017
2
Misunderstanding in chapter 2
#117 opened 5 years ago by zZthebreakerZz
1
How to formulate problem with State is a combination of multiple factors?
#114 opened 5 years ago by MJeremy2017
1
Chapter 4：seems missing self. before TRUNCATE
#113 opened 5 years ago by ZiqiChai
1
epilon not initialized
#104 opened 6 years ago by abhinavsagar
1
Maybe a little bug in chapter5 blackjack.py function 'play' line 81-85
#103 opened 6 years ago by Huixxi
1
Question about batch_updating function in chapter06/random_walk.py
#100 opened 6 years ago by hitblackjack
1
Chapter 09: Random Walk 100
#101 opened 6 years ago by xenomeno
1
Would it be OK to publish solutions to the programming exercises alongside mainly the algorithms I intend to implement from the book?
#99 opened 6 years ago by brancoliticus
1
Missing parameter description for true_reward
#98 opened 6 years ago by michaelshiyu
2
Why do not use true online Sarsa(λ) in figure 12.11
#94 opened 6 years ago by xingE650
1
Just a Thank you note
#96 opened 6 years ago by wassimseif
0
Chapter 4 jacks car rental
#93 opened 6 years ago by HareshKarnan
2
_
#92 opened 6 years ago by hitblackjack
1
Problem I meet in how TD method and MC method update the last state-value in a MRP
#91 opened 6 years ago by xingE650
1
chapter2_content.tex exercise 2.3 问题
#89 opened 6 years ago by RocStone
1
Some revision suggestions in Maximization_bias's Problem
#86 opened 6 years ago by LBAWMY
1
Q-learning Example Has No @expected
#84 opened 6 years ago by LinaeSostra
1