/opencompass

OpenCompass is an LLM evaluation platform, supporting evaluation of 20+ models over 50+ datasets, that enables fast, comprehensive benchmarking of large models using efficient distributed evaluation techniques.

Primary LanguagePythonApache License 2.0Apache-2.0

No issues in this repository yet.