rl-fundamentals

Contains RL techniques applied to various domains. The list will be updated frequently.

  1. Value function estimation from a given policy. Domain: Blackjack
  2. Policy iteration. Domain: Blackjack