/GAIA

Beating the GAIA benchmark with Transformers Agents. 🚀

Primary LanguageJupyter NotebookApache License 2.0Apache-2.0

Beating GAIA with Transformers Agents 🚀

This is the exact code used for our submission that scores #2 on the test set, #1 on the validation set.

GAIA leaderboard screenshot

Check out the current leaderboard here.

How to run tests?

First, install requirements:

pip install -r requirements.txt

Setup your secrets in a .envfile:

HUGGINGFACEHUB_API_TOKEN
SERPAPI_API_KEY
OPENAI_API_KEY
ANTHROPIC_API_KEY

And optionally if you want to use Anthropic models via AWS bedrock:

AWS_BEDROCK_ID
AWS_BEDROCK_KEY

Then run gaia.py to launch tests!