강화학습 Paper들을 정리합니다.
- [DQN] Playing Atari with Deep Reinforcement Learning [2013.12]
- paper : https://arxiv.org/abs/1312.5602
- github :
- [TRPO] Trust Region Policy Optimization [2015.02]
- paper : https://arxiv.org/abs/1502.05477
- github :
- [PPO] Proximal Policy Optimization Algorithms [2017.07]
- paper : https://arxiv.org/abs/1707.06347
- github :
- [Rainbow DQN] Rainbow: Combining Improvements in Deep Reinforcement Learning [2017.10]
- paper : https://arxiv.org/abs/1710.02298
- github :
- [WorldModel] World Models [2018.03]
- paper : https://arxiv.org/abs/1803.10122
- github :
- [RUDDER] RUDDER: Return Decomposition for Delayed Rewards [2018.06]
- paper : https://arxiv.org/abs/1806.07857
- github :
- [R2D2] Recurrent Experience Replay in Distributed Reinforcement Learning [2018.11]
- paper : https://openreview.net/forum?id=r1lyTjAqYX
- github :
- [RND] Exploration by random distillation [2018.11]
- paper : https://openreview.net/forum?id=H1lJJnR5Ym
- github :
the-gan-zoo Repository를 참조했습니다.
github : https://github.com/hindupuravinash/the-gan-zoo