sail-sg/Cheating-LLM-Benchmarks
[SafeGenAi @ NeurIPS 2024] Cheating Automatic LLM Benchmarks: Null Models Achieve High Win Rates
Jupyter NotebookMIT
Issues
- 1
Quick question about your prompt
#1 opened
[SafeGenAi @ NeurIPS 2024] Cheating Automatic LLM Benchmarks: Null Models Achieve High Win Rates
Jupyter NotebookMIT