[ACL'24] A Knowledge-grounded Interactive Evaluation Framework for Large Language Models
Primary LanguagePython