A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environments.
Primary LanguagePythonGNU General Public License v3.0GPL-3.0