/Embedded-Self-Predictions

We investigate a deep reinforcement learning (RL) architecture that supports explaining why a learned agent prefers one action over another.

Primary LanguageJupyter Notebook