Primal_Dual_RL

Comparsion between the result of our primal dual method (converges at 500 episodes) vs result of TD Actor Critic method (converges at 2000 episodes)

Graph 1 Graph 2

To run our code, simply use

Train

$ run python primal_dual_stochastic.py