/h2o-LLM-eval

Large-language Model Evaluation framework with Elo Leaderboard and A-B testing

Primary LanguageJupyter NotebookApache License 2.0Apache-2.0