agential-ai/agential
🔔🧠Easily experiment with popular language agents across diverse reasoning/decision-making benchmarks!
PythonMIT
Issues
- 0
- 0
[Feature Request]: Evaluation Metrics
#264 opened - 0
[Feature Request]: Evaluation Harness
#262 opened - 0
- 0
- 0
[Feature Request]: Standardize Base Agent
#254 opened - 0
- 0
- 0
[Feature Request]: Add simple baselines
#247 opened - 0
- 0
[Feature Request]: Implement LATS
#245 opened - 0
[Feature Request]: ExpeL Structured Outputs
#243 opened - 0
[Feature Request]: ExpeL
#242 opened - 0
[Feature Request]: MBPP for ExpeL
#241 opened - 0
[Feature Request]: HumanEval for ExpeL
#240 opened - 0
[Feature Request]: TabMWP for ExpeL
#239 opened - 0
[Feature Request]: SVAMP for ExpeL
#238 opened - 0
[Feature Request]: GSM8K for ExpeL
#237 opened - 0
[Feature Request]: FEVER for ExpeL
#236 opened - 0
[Feature Request]: TriviaQA for ExpeL
#235 opened - 0
[Feature Request]: AmbigNQ for ExpeL
#234 opened - 0
[Feature Request]: HotpotQA for ExpeL
#233 opened - 0
[Feature Request]: Refactor ExpeL
#232 opened - 0
- 0
- 0
[Feature Request]: Refactor Self-Refine
#226 opened