Skip to content

Latest commit

 

History

History
35 lines (25 loc) · 996 Bytes

File metadata and controls

35 lines (25 loc) · 996 Bytes

Beating GAIA with Transformers Agents 🚀

This is the exact code used for our submission that scores #2 on the test set, #1 on the validation set.

GAIA leaderboard screenshot

Check out the current leaderboard here.

How to run tests?

First, install requirements:

pip install -r requirements.txt

Setup your secrets in a .envfile:

HUGGINGFACEHUB_API_TOKEN
SERPAPI_API_KEY
OPENAI_API_KEY
ANTHROPIC_API_KEY

And optionally if you want to use Anthropic models via AWS bedrock:

AWS_BEDROCK_ID
AWS_BEDROCK_KEY

Then run gaia_multiagent.py to launch tests!

Head to smolagents for a better version!

We've mnaged to increase score to 55% using the new framework smolagents! Head to this folder in smolagents to find a more recent run on GAIA, with better scores than this one!