Code samples for Deep Reinforcement Learning Hands-On book. Original repository code
- Python 3.x
- PyTorch 1.x
- Gym
- Chapter 2: OpenAI Gym
- Chapter 3: Deep Learning with PyTorch
- Chapter 4: Cross Entropy method
- Chapter 5: Tabular learning and the Bellman equation
- Chapter 6: Deep Q-Networks
- Chapter 7: DQN extensions
- Chapter 8: Stocks trading using RL
- Chapter 9: Policy Gradients: an alternative
- Chapter 10: Actor-Critic method
- Chapter 11: Asynchronous Advantage Actor-Critic
- Chapter 12: Chatbots traning with RL
- Chapter 13: Web navigation
- Chapter 14: Continuous action space
- Chapter 15: Trust regions: TRPO, PPO and ACKTR
- Chapter 16: Black-box optimisation in RL
- Chapter 17: Beyond model-free: imagination
- Chapter 18: AlphaGo Zero
tensorboard --logdir runs