Reinforcement Learning

Repository dedicated to the implementation of several Reinforcement Learning methods in Python, exemplified by OpenAI environments. Some performance comparisons are included.

Currently implemented:

agentTabular.py: Tabular action value methods.
- SARSA
- SARSA(λ)
- Q-learning
- Watkins Q-learning
agentIncrementalVFA.py: Incremental methods using value function approximation.
- TD
- TD(λ)
- Gradient TD2
- Gradient Q-learning
- Recursive Least Squares TD
agentBatchVFA.py: Batch methods using value function approximation.
- Least Squares TD
- Least Squares TD(λ)
- Least Squares TDQ
- Least Squares Policy Iteration TD
agentActorCritic.py: Actor-critic methods.
- Q Actor-Critic
- Advantage Actor-Critic
- TD Actor-Critic
- TD(λ) Actor-Critic
agentModelBased.py: Model based methods.
- DynaQ
- Monte Carlo Tree Search
- TD Tree Search
- Dyna2

Several classes and functions required for the above files are contained in util.py.

The environment file gridworld.py might be required for some examples and comparisons.

DapengChalmers/RL

Reinforcement Learning