/N8Bench

30 questions to tell the smart LLMs from the dumb.

Primary LanguageJavaScript

A benchmark of 30 questions to determine LLM intelligence - which is quite discriminatory between performance:

leaderboard

Costs only 20 cents to run.