Using Q-learning to teach multiple agents to escape from a small room as quickly as possible.
Primary LanguageCThe UnlicenseUnlicense