mpatacchiola/dissecting-reinforcement-learning
Python code, PDFs and resources for the series of posts on Reinforcement Learning which I published on my personal blog
PythonMIT
Issues
- 0
Part 3, TD(lambda): trace_matrix should be reset to zeroes at the beginning of each epoch
#22 opened by johanwiden - 1
Part.1 Modified Policy Iteration with Simplified Bellman Equation and Linear Algebra Policy Evaluation Infinite Loop
#20 opened by CesarAndresRojas - 4
mdp linear algebra approach cannot stop
#6 opened by zdarktknight - 0
Missing brackets
#18 opened by DoDzilla-ai - 0
Print statement causing issue in Python 3.x
#15 opened by DoDzilla-ai - 1
Two undefined variables
#16 opened by DoDzilla-ai - 2
The clean robot example on chapter 1 ?
#14 opened by ngthanhtin - 2
11X11 grid
#13 opened by Andlibmehndi - 2
about greedy agent in multi-armed bandit
#12 opened by ZichaoHuang - 1
Looking forward to post #8
#5 opened by BKJackson - 2
- 1
- 1
- 1
Alternative to Numpy
#1 opened by abencomo