This repository is not active
nickschuetz/llm-colosseum
Benchmark LLMs by fighting in Street Fighter 3! The new way to evaluate the quality of an LLM
Jupyter NotebookMIT
Benchmark LLMs by fighting in Street Fighter 3! The new way to evaluate the quality of an LLM
Jupyter NotebookMIT
This repository is not active