xuanlinli17/CS285_Fa19_Deep_Reinforcement_Learning

My solutions to UC Berkeley CS285 (originally CS294-112, deeprlcourse) Fall 2019 assignments

Python

Issues

hw4
#10 opened 5 years ago by cometta
0
why retreive the first element of action
#9 opened 5 years ago by cometta
1
hw2: general advantage estimation
#8 opened 5 years ago by cometta
3
Reproducing the result of hw1 problem 1(b)
#3 opened 5 years ago by Duconnor
2