THUNLP-MT/StableToolBench

A new tool learning benchmark aiming at well-balanced stability and reality, based on ToolBench.

PythonApache-2.0

Issues

Is the server still working properly?
#25 opened 2 months ago by Octobrist
0
ToolBench Key
#24 opened 3 months ago by hoyeongchoi
0
requests.exceptions.ConnectionError: HTTPConnectionPool(host='8.218.239.54', port=8080
#8 opened 10 months ago by lileishitou
1
Looking for a ToolBench key
#23 opened 4 months ago by Dlxxx
0
inference problem
#15 opened 4 months ago by farawayxxx
0
ToolBench Key
#21 opened 6 months ago by Dandelionym
2
ToolBench Key
#22 opened 5 months ago by Hanlin1004
0
role error
#16 opened 5 months ago by farawayxxx
1
Did ToolBench server crashed?
#18 opened 6 months ago by Reason-Wang
1
DFS.py changes causing functions to not be called with ToolLLaMa
#19 opened 6 months ago by kingb12
1
Could you release the reproduction data for your result
#5 opened 10 months ago by p1nksnow
2
报错信息：AttributeError: function_call`
#17 opened 7 months ago by wupaopao123
0
gpt3.5 > gpt4 on pass rate?
#14 opened 8 months ago by stanpcf
1
applied for toolbench_key but no one responded
#13 opened 8 months ago by stanpcf
2
Implementation of DFSDT and ReACT
#11 opened 9 months ago by JuhaoLiang1997
1
How is native LLM on this benchmark?
#12 opened 8 months ago by YenFuLin
1
Correctness of API Simulator
#4 opened 9 months ago by xuanz20
1
Encounter with 500 Internal Server Error when requesting toolbench_url
#9 opened 9 months ago by Dr-Left
2
Reproduce experimental results.
#7 opened 10 months ago by Taeyoung-Jang
1
tool_root_dir in the inference script
#6 opened 10 months ago by zhiyuanc2001
2
Do we need RapidAPI if run the server using Docker?
#3 opened 10 months ago by xuanz20
2
Plans to add a license?
#2 opened a year ago by timisstrong
2