/rl-study

Primary LanguageJupyter Notebook

rl-study

Practice codes of reinforcement learning study

Book

contents

  1. Tabular Solution Methods

    1. Multi-arm Bandits

    2. Finite Markov Decision Processes

    3. Dynamic Programming

    4. Monte Carlo Methods

    5. Temporal-Difference Learning

    6. Eligibility Traces

    7. Planning and Learning with Tabular Methods

  2. Approximate Solution Methods

  3. Frontiers