OpenGenerativeAI/llm-colosseum
Benchmark LLMs by fighting in Street Fighter 3! The new way to evaluate the quality of an LLM
Jupyter NotebookMIT
Stargazers
- alloyappleChina
- ArvinKing77US
- blackSp0nge
- cryoncryptoModular Labs Srl
- danielhv10Spain
- digitalappliedDigital Applied
- evanshortiss@neondatabase
- fonix
- fritolhere
- gm8xx8
- hdvrai
- idgmatrixGenAI Korea
- Madankh
- navneetdesaiRochester, New York
- oulianovphospho
- Pierre-LouisBJT@phospho-app
- plasmanunchucks
- Platinn@phospho-app
- Pseudopode
- raonigabrielCuritiba - Paraná, Brazil
- raoufchebriNeon
- sachaarbonel@GetStream
- samchaaatraveling
- shuxiaokaiM78 Nebula
- SidUMicrosoft
- sroeckerRed Hat
- StanGirard@QuivrHQ
- teristam
- TheRedOperator
- TheSouthFrogCUHK
- UglyStupidHonestHaarlem Netherlands
- VirtualFlowIncVirtualFlow Inc
- vpvpvpvpNeuralnetic
- websitegardener
- Zewed
- zwhe99Shanghai Jiao Tong University