CS 294-112 | Deep Reinforcement Learning Fall 2018 - Assignment Solutions

My own solutions for Cs294-112

Project1

Behavioral Cloning vs DAgger

I was able to get the results below with given hyperparameter.

Learning Curves

Hopper-v2

Reacher-v2

Agents with huge improvements in DAgger have shown soaring loss function in learning curves.

Policy Gradient Method in discrete action space and continous action space

FrozenLake-v2

HalfCheetah-v2