h2oai/h2o-LLM-eval
Large-language Model Evaluation framework with Elo Leaderboard and A-B testing
Jupyter NotebookApache-2.0
Issues
- 0
[BUG] Double click on one of the rows in the "Elo Leaderboard" blocks the app
#13 opened by pascal-pfeiffer - 0
Do the A&B test support multi-turn conversation
#12 opened by vackosar - 2
Sorry for the Inconvenience. Please refresh your browser to restart H2O LLM Eval.
#9 opened by cfregly - 1
- 0
Batch insert with psycopg3
#4 opened by sAbhay