OpenGenerativeAI/llm-colosseum

Benchmark LLMs by fighting in Street Fighter 3! The new way to evaluate the quality of an LLM

Jupyter NotebookMIT

Readme
25Issues
991Stargazers
17Watchers

Stargazers

Prev
Next

Contact site admin: Geeks.