Contents of the sections Textbook by Sutton and Bart
Lesson: Temporal-Difference Methods.
Lesson: Solve OpenAI Gym's Taxi-v2 Task
Lesson: RL in Continuous Spaces
- Discretization
- Tile Coding
- Coarse Coding
- Function Approximation
Deep RL
- Learning to see and act
- Video: Deep Q-Learning
- Human-level control through deep reinforcement learning
Deep RL for Robotics
- the Japanese robot company Fanuc
- Robot learns via trial and error like a human
- DEEP LEARNING IN PRODUCTION & WAREHOUSING WITH AMAZON ROBOTICS
Opportunities
Deep Q-networks (DQN)
- Experience Relay: A Deeper Look at Experience Replay
- Fixed Q Targets:
- Doueble DQN: Deep Reinforcement Learning with Double Q-learning
- Prioritized Experience Replay: Prioritized Experience Replay
- Some transition information should be sampled.
- Dueling Network Dueling Network Architectures for Deep Reinforcement Learning
- Pros:
Some other proposed methods