RLStuff A collection of me playing around with Reinforcement Learning and other stuff. Implemented Algorithms Q Learning Deep Q Learning REINFORCE REINFORCE with Baseline Actor-Critic TD(0) Actor-Critic Forward-view TD(λ) Actor-Critic Backward-view TD(λ) Off-Policy Actor Critic Compatible Off-Policy Deterministic Actor-Critic with Q Critic Deep Deterministic Policy Gradient Proximal Policy Optimization Interesting papers Deep Reinforcement Learning that Matters Implemmentation Matters in Deep Policy Gradients: A Case Study on PPO and TRPO What Matters In On-Policy Reinforcement Learning? A Large-Scale Empirical Study