/llmjudge

Exploring limitations of LLM-as-a-judge

Primary LanguageJupyter Notebook

Watchers