haoranD/opencompass
OpenCompass is an LLM evaluation platform, supporting evaluation of HuggingFace, API and custom models (LLaMA, ChatGPT, Claude, etc) over 50+ datasets. It enables fast, comprehensive benchmarking of large models using efficient distributed evaluation techniques.
PythonApache-2.0
Watchers
No one’s watching this repository yet.