agential-ai/agential

Readme
Issues
Stargazers
Watchers

[Feature Request]: Reflexion

Opened this issue 3 months ago · 0 comments

alckasoc commented 3 months ago

Feature Description

Implement:

HotpotQA
#123
#122
#124
#125
#126
#127
#128
#129
#131
#132
#133 (includes ALFWorld & WebShop)

The decision-making benchmarks (ALFWorld, WebShop, and AgentBench) will require more design work. Swapping out the prompts won't suffice.

Run:

HotpotQA
TriviaQA
AmbigNQ
GSM8k
SVAMP
TabMWP
MBPP
HumanEval
ALFWorld
WebShop
AgentBench (includes ALFWorld & WebShop)

Topics

llm-based-agent llms agential

Share to

Contact site admin: Geeks.