Mingcong-Cao/PDMORL-Preference-Driven-Multi-Objective-Reinforcement-Learning-Algorithm-JAX
A novel preference-driven multi-objective reinforcement learning algorithm using a single policy network that covers the entire preference space in a given domain.
PythonMIT
Stargazers
No one’s star this repository yet.