omerbsezer/Reinforcement_learning_tutorial_with_demo
Reinforcement Learning Tutorial with Demo: DP (Policy and Value Iteration), Monte Carlo, TD Learning (SARSA, QLearning), Function Approximation, Policy Gradient, DQN, Imitation, Meta Learning, Papers, Courses, etc..
Jupyter Notebook
Watchers
- Andrew-MontanaUkraine, Odessa
- avijit9Hyderabad, India
- danielneumann
- David-WL
- eemailme
- hirakpal
- Htilil
- issifuabdulmajeedIstanbul Turkey
- jhcloos
- jiashi9
- KelvinsonSomewhere
- ksitikomariahSouth Korea
- laraneaLaranea
- MaKailiThe Chinese University of Hong Kong
- mc-oUniversity College London
- norainagain
- omerbsezerGermany
- paper2code-bot@paper2code
- Pushpesh-31
- quickresolveQuick Resolve
- shanshanlaichi54
- somy1997
- spinynormal
- TaoYang2015
- wlvh
- zyongbo