Jae's record of self-teaching reinforcement learning (RL), as it's used in psychological research. Also a record of self-teaching Python(!), so lacks any sort of the elegance of well-written Python.
- Learning the value of a single stimulus (learning rate parameter alpha)
- Multi-stimulus environments (Rescorla-Wagner and the prediction error delta)
- Choosing between stimuli (exploration parameter beta)
- Multi-step choice (temporal discounting parameter gamma)
- Agents with "memory activation" (TD-lambda)
- Sarsa and Q-learning
- Model-based (MB) agents and transition structure
- The Successor Representation (SR) and transition structure
- SR + Dyna replay + prioritized sweeping
- SF + LSFM
- Parameter-fitting