/llm-summary-evals

Comparing the performance of GPT-4 and Claude 3 Opus on a summarization task

Primary LanguageJupyter Notebook

Stargazers