Un-official lecture notes of cs294-112-fall2017. Borrow the templates from notes of cs224n-stanford.
Note: there may be tones of typos in the notes.
- Update lecture 6: Value Functions
- Update lecture 5: Actor-critic algorithm
- Update lecture 4: Policy gradient
- Update lecture 3: Introduction to Reinforcement Learning
- Update lecture 2: Imitation Learning