Update ToySGD
benjamc opened this issue · 3 comments
benjamc commented
- log -> log10
- reward has high negative values soon --> too easy?
benjamc commented
Reward should have positive values. High negative values bc of momentum bug.
Both points adressed in #126
benjamc commented
- feed coefficients in reverse order to numpy Polynomial and recreate instance sets