xuanlinli17/CS285_Fa19_Deep_Reinforcement_Learning
My solutions to UC Berkeley CS285 (originally CS294-112, deeprlcourse) Fall 2019 assignments
Python
Issues
- 0
- 1
why retreive the first element of action
#9 opened by cometta - 3
hw2: general advantage estimation
#8 opened by cometta - 2
Reproducing the result of hw1 problem 1(b)
#3 opened by Duconnor