keon/policy-gradient

Minimal Monte Carlo Policy Gradient (REINFORCE) Algorithm Implementation in Keras

PythonMIT

Issues

Train agent process error
#6 opened 4 years ago by SZH1230456
0
Incorrect normalising of discounted rewards
#5 opened 5 years ago by tall-josh
3
Loss function/Labels for neural network used?
#4 opened 7 years ago by abhigenie92
2
Why normalize predicted probabilities?
#3 opened 7 years ago by abhigenie92
1
Minor Questions
#1 opened 7 years ago by abhigenie92
1