/T-Eval

[ACL2024] T-Eval: Evaluating Tool Utilization Capability of Large Language Models Step by Step

Primary LanguagePythonApache License 2.0Apache-2.0

Watchers