microsoft/eureka-ml-insights
A framework for standardizing evaluations of large foundation models, beyond single-score reporting and rankings.
PythonApache-2.0
A framework for standardizing evaluations of large foundation models, beyond single-score reporting and rankings.
PythonApache-2.0