-
SARSA, DQN
-
Policy Gradient, A2C
-
DPG, DDPG
-
TRPO
-
PPO
-
Tsallis, SAC
-
reference : https://spinningup.openai.com/en/latest/index.html
-
future work : object detection, NLP, GAN, GAIL, NDGAIL
각자 알고리즘 폴더 내에 자기이름 폴더생성하여 코드작성하기
- 예시 : DQN/dohyeong/train.py, DQN/dohyeong/model.py