Implementation of the Options framework, using Q-learning algorithm.
Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning [Paper]
- Numpy
- Matplotlib
- Gym
Implementation of the Options framework, using Q-learning algorithm.
Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning [Paper]