harwiltz/distributional-superiority

Author implementation of DSUP(q) algorithms from the NeurIPS 2024 paper "Action Gaps and Advantages in Continuous-Time Distributional Reinforcement Learning"

PythonMIT

Stargazers

No one’s star this repository yet.