- Neural Arithmetic Logic Units (NALU) https://arxiv.org/abs/1808.00508
- Dyna-Q http://incompleteideas.net/book/the-book-2nd.html
- Evolving Neural Networks through Augmenting Topologies (NEAT) http://nn.cs.utexas.edu/downloads/papers/stanley.ec02.pdf
- Human-level control through deep reinforcement learning (DQN) https://storage.googleapis.com/deepmind-media/dqn/DQNNaturePaper.pdf
- Proximal Policy Optimization Algorithms (PPO) https://arxiv.org/abs/1707.06347