/SmartPlay

SmartPlay is a benchmark for Large Language Models (LLMs). Uses a variety of games to test various important LLM capabilities as agents. SmartPlay is designed to be easy to use, and to support future development of LLMs.

Primary LanguagePythonCreative Commons Attribution 4.0 InternationalCC-BY-4.0

Watchers

No one’s watching this repository yet.