BinaryFiddler/arena-hard-auto
Arena-Hard-Auto: An automatic LLM benchmark.
Jupyter NotebookApache-2.0
Stargazers
No one’s star this repository yet.
Arena-Hard-Auto: An automatic LLM benchmark.
Jupyter NotebookApache-2.0
No one’s star this repository yet.