This repository contains PyTorch implementations of reinforcement learning algorithms. Its purpose is to provide straightforward and easily understandable code for individuals to learn about basic reinforcement learning algorithms.
- Deep Q Learning (DQN) (Mnih et al. 2013)
- D3QN (Wang et al. 2016)
- Deep Deterministic Policy Gradients (DDPG) (Lillicrap et al. 2016 )
- Twin Delayed Deep Deterministic Policy Gradients (TD3) (Fujimoto et al. 2018)
- Soft Actor-Critic (SAC) (Haarnoja et al. 2018)
- Soft Actor-Critic for Discrete Actions (SAC-Discrete) (Christodoulou 2019)
- Tsallis Actor-Critic (TAC) (Lee, Kyungjae, et al. 2019)
- Proximal Policy Optimisation (PPO) (Schulman et al. 2017)