This repository contains PyTorch implementations of some of the mainly used Reinforcement Learning algorithms from scratch. The project focuses at implementing the algorithms in a clean and readable way.
Still under development. Currently available algorithms:
- Policy Gradient, no baseline (PG)
- Policy Gradient, value baseline (PG)
- Deep Q Network (DQN)
- Deep Deterministic Policy Gradient (DDPG)
- Soft Actor Critic (SAC)
- Proximal Policy Optimization (PPO)