/rl-paper-study

Reinforcement Learning paper review study

MIT LicenseMIT

rl-paper-study

Reinforcement Learning paper review study

1st paper list

Date Paper Presenter Links
5/11 Playing Atari with Deep Reinforcement Learning, Mnih et al, 2013. Ingyun Ahn [paper] [review]
5/11 Dueling Network Architectures for Deep Reinforcement Learning, Wang et al, 2015. Jaeyoung Ahn [paper] [review]
5/25 Deep Reinforcement Learning with Double Q-learning, Hasselt et al 2015. Do-Hoon Kim [paper] [review]
5/25 Asynchronous Methods for Deep Reinforcement Learning, Mnih et al, 2016. Seungyoun Shin [paper] [review]
6/1 Continuous Control With Deep Reinforcement Learning, Lillicrap et al, 2015. Chris Ohk [paper] [review]
6/1 Mastering the game of Go with deep neural networks and tree search, D. Silver et al, Nature, 2016. Minseok Seong [paper] [review]
6/8 Curiosity-driven Exploration by Self-supervised Prediction, Pathak et al, 2017. Haneul Choi [paper] [review]
6/8 Mastering the game of Go without human knowledge, D. Silver et al, Nature, 2017. Donggu Kang [paper] [review]
6/22 Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model, J. Schrittwieser et al, 2019. Yunhyeok Kwak [paper] [review]
6/29 Contextual Decision Processes with low Bellman rank are PAC-Learnable, N. Jiang et al, 2017. Hoesung Ryu [paper] [review]
6/29 Evolution Strategies as a Scalable Alternative to Reinforcement Learning, Salimans et al, 2017. Chanhyuk Park [paper] [review]
6/29 QT-Opt: Scalable Deep Reinforcement Learning for Vision-Based Robotic Manipulation, Kalashnikov et al, 2018. Hyecheol (Jerry) Jang [paper] [review]

2nd paper list

Date Paper Presenter Links
7/27 Deep Recurrent Q-Learning for Partially Observable MDPs, M. Hausknecht et al, 2015. Minsuk Sung [paper] [review]
8/3 Hierarchical Visuomotor Control of Humanoids, J. Merel et al, 2018. Seonghyeon Moon [paper] [review]
8/10 Learning Dexterous In-Hand Manipulation, M. Andrychowicz et al, 2020. Ingyun Ahn [paper] [review]
8/10 Deep Reinforcement Learning with a Natural Language Action Space, J. He et al, 2015. Jihun Kim [paper] [review]
8/10 Program Guided Agent, SH. Sun et al, 2020. Haneul Choi [paper] [review]
8/24 Trust Region Policy Optimization, J. Schulman et al, 2015. Chris Ohk [paper] [review]
8/31 Proximal Policy Optimization Algorithms, J. Schulman et al, 2017. Chris Ohk [paper] [review]
8/31 Implementation Matters in Deep RL: A Case Study on PPO and TRPO, L. Engstrom et al, 2020. Yunhyeok Kwak [paper] [review]
9/7 Generative Adversarial Imitation Learning, J. Ho et al, 2016. Hoesung Ryu [paper] [review]
9/14 Efficient Reductions for Imitation Learning, S. Ross et al, 2010. Hyecheol (Jerry) Jang [paper] [review]
9/14 Variational Discriminator Bottleneck: Improving Imitation Learning, Inverse RL, and GANs by Constraining Information Flow, XB. Peng et al, 2018. Do-Hoon Kim [paper] [review]
9/14 Grandmaster Level in StarCraft II using Multi-agent Reinforcement Learning, O. Vinyals et al, 2019. Donggu Kang [paper] [review]

3rd paper list

Date Paper Presenter Links
11/9 Trust Region Policy Optimization, J. Schulman et al, 2015. Chris Ohk [paper] [review]
TBA Proximal Policy Optimization Algorithms, J. Schulman et al, 2017. Chris Ohk [paper] [review]
TBA Deep Reinforcement Learning with Stacked Hierarchical Attention for Text-based Games, Y. Xu et al, 2020. Chris Ohk [paper] [review]
11/16 Model based Reinforcement Learning for Atari, L. Kaiser et al, 2019. Sungdong Yoo [paper] [review]
11/16 High-dimensional Continuous Control using Generalized Advantage Estimation, J. Schulman et al, 2015. Junyeob Baek [paper] [review]
11/23 The Option-critic Architecture, PL. Bacon et al, 2017. Seonghyeon Moon [paper] [review]
11/23 Rainbow: Combining Improvements in Deep Reinforcement Learning, M. Hessel et al, 2017. Sungkwon On [paper] [review]
11/30 Prioritized Experience Replay, T. Schaul et al, 2015. Donggu Kang [paper] [review]
11/30 Soft Actor-critic: Off-policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor, T. Haarnoja et al, 2018. Hyo Jeon [paper] [review]
12/7 Distributed Prioritized Experience Replay, D. Horgan et al, 2018. Jungyeon Lee [paper] [review]
12/7 Impala: Scalable Distributed Deep-RL with Importance Weighted Actor-learner Architectures, L. Espeholt et al, 2018. Junhyung Kang [paper] [review]
12/14 A Distributional Perspective on Reinforcement Learning, MG. Bellemare et al, 2017. Wonwoo Choi [paper] [review]
12/14 Addressing Function Approximation Error in Actor-critic Methods, S. Fujimoto et al, 2018. Sooyoung Lee [paper] [review]
12/21 Action-gap Phenomenon in Reinforcement Learning, A. Farahmand et al, 2011. Handong Im [paper] [review]
12/21 Multi-agent Actor-critic for Mixed Cooperative-competitive Environments, R. Lowe et al, 2017. Juyeon Kim [paper] [review]
12/28 Second-order Optimization for Deep Reinforcement Learning using Kronecker-factored Approximation, Y. Wu et al, 2017. Youngjin Jung [paper] [review]
12/28 Evolution Strategies as a Scalable Alternative to Reinforcement Learning, T. Salimans et al, 2017. Heemok Ha [paper] [review]

Contact

You can contact me via e-mail (utilForever at gmail.com). I am always happy to answer questions or help with any issues you might have, and please be sure to share any additional work or your creations with me, I love seeing what other people are making.

License

The class is licensed under the MIT License:

Copyright (c) 2020 RL Paper Study Team

Permission is hereby granted, free of charge, to any person obtaining a copy of this software and associated documentation files (the "Software"), to deal in the Software without restriction, including without limitation the rights to use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies of the Software, and to permit persons to whom the Software is furnished to do so, subject to the following conditions:

The above copyright notice and this permission notice shall be included in all copies or substantial portions of the Software.

THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.