/T-Eval

T-Eval: Evaluating Tool Utilization Capability of Large Language Models Step by Step

Primary LanguagePythonApache License 2.0Apache-2.0

Stargazers