aymeric-roucher/agent_reasoning_benchmark
🔧 Compare how Agent systems perform on several benchmarks. 📊🚀
Jupyter NotebookApache-2.0
Stargazers
- alt-glitchJulep AI
- araujofIBM Research
- BaHuy15
- bobbyhchrist
- catyungSuper Cat Technology Limited
- CoolOppoPittsburgh, PA
- DarrellYoung
- DhruvJari07Beam Data
- dorucioclea@CoreBuildSoftware
- edisonjoao1
- fgbelidjiHugging Face
- fpriviteraItaly
- gulshansainisIndia
- harshagv999
- herryangsong
- kunatoKUNANA AI
- laleph
- mail4y
- Minthos
- mrm1001
- ollmerServiceNow Research
- plaggy
- RadchaneepornC
- renan-kazazoglu@en-ko
- rmanoka
- smuotoe
- songkq
- svjack
- taoshen58Sydney, Australia
- THEFIG06Customdata.io
- tigicion
- tokestermwCresta
- tombo419
- trotsky1997Alibaba
- zhangzhiqiangccmCUC
- zhoujz10