/LLM-RGB

LLM Reasoning and Generation Benchmark. Evaluate LLMs in complex scenarios systematically.

Primary LanguageTypeScriptMIT LicenseMIT

Watchers