/Randomized-Return-Decomposition

TensorFlow implementation for our paper "Learning Long-Term Reward Redistribution via Randomized Return Decomposition"

Primary LanguagePythonMIT LicenseMIT

Stargazers