SuerpX/Embedded-Self-Predictions
We investigate a deep reinforcement learning (RL) architecture that supports explaining why a learned agent prefers one action over another.
Jupyter Notebook
We investigate a deep reinforcement learning (RL) architecture that supports explaining why a learned agent prefers one action over another.
Jupyter Notebook