Custom PyTorch implementations of DeepRL models from OpenAI's SpinningUp series.
Order of complexity
- Vanilla Policy Gradient
- Trust Region Policy Optimization
- Proximal Policy Optimization
- Deep Deterministic Policy Gradient
- Twin Delayed DDPG(4.)
- Soft Actor-Critic