haoranD/opencompass

OpenCompass is an LLM evaluation platform, supporting evaluation of HuggingFace, API and custom models (LLaMA, ChatGPT, Claude, etc) over 50+ datasets. It enables fast, comprehensive benchmarking of large models using efficient distributed evaluation techniques.

PythonApache-2.0

Watchers

No one’s watching this repository yet.