OpenGenerativeAI/llm-colosseum
Benchmark LLMs by fighting in Street Fighter 3! The new way to evaluate the quality of an LLM
Jupyter NotebookMIT
Issues
- 1
🏟️ (9ef4) Error: Wrong rom file for sfiii3n:
#56 opened by sevaroy - 11
Fighters not fighting
#60 opened by nickschuetz - 1
can't fetch new commit
#54 opened by taozhiyuai - 1
[question]Two characters cannot approach each other after they switch positions.
#46 opened by shawokou123 - 3
Project no more maintained?
#57 opened by Greatz08 - 1
- 4
- 8
can not run local model
#50 opened by mengxiyou - 1
ERROR after upgrade new docker image
#52 opened by taozhiyuai - 4
Issue while trying to run locally
#49 opened by edgmin - 3
ELO ranking score?
#47 opened by Tokkiu - 1
Hello, brother. How to modify the program so that AI can play computer controlled characters?
#41 opened by shawokou123 - 0
Add Google Gemini model
#43 opened by shawokou123 - 0
How to use Google gemini model
#42 opened by shawokou123 - 3
Is there a way to set it to do best 3 of 5?
#40 opened by lafintiger - 1
Yi 6b , no action
#39 opened by taozhiyuai - 29
can I run locally?
#28 opened by taozhiyuai - 3
- 0
how to set show_final=true???
#38 opened by taozhiyuai - 0
suggestion: add blood in log
#37 opened by taozhiyuai - 5
Report Different models fight on the street
#35 opened by taozhiyuai - 4
how to change characters?
#34 opened by taozhiyuai - 4
- 1
how it learn from history?
#31 opened by taozhiyuai - 5
larger model, worse peformance?
#30 opened by WSPeng - 1
fight in loop
#32 opened by taozhiyuai - 4
.diambra/roms does not exist.
#27 opened by taozhiyuai - 3
Unset API keys are not handled properly
#29 opened by moritz-august - 2
MacOS only?
#24 opened by Datou