Popular Deep Reinforcement Learning algorithms implementation by pytorch. Implementing algorithms:
- Deep Q learning
- Async Advantage Actor Critic(A3C)
- Deep Deterministic Policy Gradient (DDPG)
- Proximal Policy Optimization (PPO)
- Actor Critic using Kronecker-Factored Trust Region(ACKTR)
- World Models
- Human-level control through deep reinforcement learning
- Asynchronous Methods for Deep Reinforcement Learning
- Deterministic Policy Gradient Algorithms
- Continuous control with deep reinforcement learning
- Trust Region Policy Optimization
- Proximal Policy Optimization Algorithms
- Emergence of Locomotion Behaviours in Rich Environments
- Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation
- World Models