cho3/ReinforcementLearning.jl

Something that tries to tie together various online model-free methods into a vaguely extensible and sensible api

Jupyter Notebook

Issues

How to elegantly handle both continuous and discrete action spaces
#3 opened 9 years ago by cho3
1
Break up policy.jl
#5 opened 9 years ago by cho3
0
How to handle different kinds of learning rate annealing and make it consistent with different update methods
#4 opened 9 years ago by cho3
0
Figure out exploration policies
#1 opened 9 years ago by cho3
1
Figure out how to elegantly handle feature functions that handle states and actions, and just states
#2 opened 9 years ago by cho3
0