AutoPackAI/Auto-GPT-Benchmarks

A repo built for the purpose of benchmarking the performance of agents, regardless of how they are set up and how they work.

PythonMIT

Auto-GPT Benchmark

A repo built for the purpose of benchmarking the performance of agents far and wide, regardless of how they are set up and how they work

Scores:

Scoring of agents will go here. Both overall and by category.

Integrated Agents

Auto-GPT
gpt-engineer
mini-agi
smol-developer