/MeanQ

Code base for paper: Reducing Variance in Temporal-Difference Value Estimation via Ensemble of Deep Networks

Primary LanguagePython

MeanQ

Code base for paper: Reducing Variance in Temporal-Difference Value Estimation via Ensemble of Deep Networks

Many implementations details of this project are adapted from SUNRISE https://github.com/pokaxpoka/sunrise. Thanks Kimin!