psychNerdJae/learning-rl

Reinforcement learning as used in psychological research

Jupyter NotebookMIT

Learning RL

Jae's record of self-teaching reinforcement learning (RL), as it's used in psychological research. Also a record of self-teaching Python(!), so lacks any sort of the elegance of well-written Python.

Topics

Learning the value of a single stimulus (learning rate parameter alpha)
Multi-stimulus environments (Rescorla-Wagner and the prediction error delta)
Choosing between stimuli (exploration parameter beta)
Multi-step choice (temporal discounting parameter gamma)
Agents with "memory activation" (TD-lambda)
Sarsa and Q-learning
Model-based (MB) agents and transition structure
The Successor Representation (SR) and transition structure

Potential future topics

SR + Dyna replay + prioritized sweeping
SF + LSFM
Parameter-fitting

Inspiration / tutorials