DialogBench: Evaluating LLMs as Human-like Dialogue Systems
Primary LanguagePython
This repository is not active