/opencompass

OpenCompass is an LLM evaluation platform, supporting evaluation of HuggingFace, API and custom models (LLaMA, ChatGPT, Claude, etc) over 50+ datasets. It enables fast, comprehensive benchmarking of large models using efficient distributed evaluation techniques.

Primary LanguagePythonApache License 2.0Apache-2.0

Watchers

No one’s watching this repository yet.