This is a reconstruction of previous repository(rl-algorithms). Most algorithms in it are base algorithms in recent deep reinforcement learning domain(click any one you want).
- python 3.8.3
- pytorch >= 1.6.0
- gym 0.18.0
Index | Algorithms | Year | Paper URL |
---|---|---|---|
1 | DQN | 2013 | https://www.cs.toronto.edu/~vmnih/docs/dqn.pdf |
2 | DRQN | 2015 | https://arxiv.org/abs/1507.06527 |
3 | ActorCritic | -- | -- |
4 | DDPG | 2015 | https://arxiv.org/abs/1509.02971 |
5 | OptionCritic | 2016 | https://arxiv.org/pdf/1609.05140.pdf |
6 | A3C | 2016 | https://arxiv.org/abs/1602.01783 |
7 | PPO | 2017 | https://arxiv.org/abs/1707.06347 |
8 | TD3 | 2018 | https://arxiv.org/abs/1802.09477 |
9 | SAC | 2018 | https://arxiv.org/abs/1801.01290 |
10 | IQN | 2018 | https://arxiv.org/pdf/1806.06923.pdf |