- Deep Q-Network (DQN)
- Double DQN (DDQN)
- Advantage Actor-Critic (A2C)
- Asynchronous Advantage Actor-Critic (A3C)
- Deep Deterministic Policy Gradient (DDPG)
- Truncated Natural Policy Gradient (TNPG)
- Trust Region Policy Optimization (TRPO)
- Generalized Advantage Estimator (GAE)
- Proximal Policy Optimization (PPO)
- Soft Actor-Critic (SAC)
- Apprenticeship Learning via Inverse Reinforcement Learning (APP)
- Maximum Entropy Inverse Reinforcement Learning (MaxEnt)
- Generative Adversarial Imitation Learning (GAIL)
- Variational Adversarial Imitation Learning (VAIL)
dongminlee94/Reinforcement-Learning-Code
A repository for code of reinforcement learning algorithms with PyTorch
PythonMIT