A repo built for the purpose of benchmarking the performance of agents far and wide, regardless of how they are set up and how they work
Click here to see the results and the raw data!!
More agents coming soon !
A repo built for the purpose of benchmarking the performance of agents, regardless of how they are set up and how they work.
Jupyter NotebookMIT
A repo built for the purpose of benchmarking the performance of agents far and wide, regardless of how they are set up and how they work
Click here to see the results and the raw data!!
More agents coming soon !