actor in webshop does not intake the memory and relfexion

Question

actor in webshop does not intake the memory and relfexion

yananchen1989 opened this issue 10 months ago · 7 comments

yananchen1989 commented 10 months ago

hi there,

I am a bit confused about the reflexion for webshop.
in code here, line 245,
https://github.com/noahshinn/reflexion/blob/main/webshop_runs/webshop_trial.py
the llm actor only intakes the base_prompt + prompt, which is the trajectory in current step, which is the left yellow block in the figure.

however, it seems that the llm actor does not intake the env_history which is the right yellow block in the figure, which contains the memory and reflexion from previous trials (if trial > 1)

may I know if I miss something ?

if this llm actor surely does not intake the memory, can it be explained that there is no gain in webshop task, as you reported in the paper ?
thanks.

Answer 1 · 2024-02-02T23:35:53.000Z

i forked your repo and made changes here main...yananchen1989:reflexion:yc#diff-36a02556b49e22008fa36a519bf0cde61f8343559dfde60a3c229fb72176d00fR304

not sure it should be like that. please advise.

Answer 2 · 2024-02-05T21:42:07.000Z

my tested results.
fyi

Answer 3 · 2024-04-13T04:27:56.000Z

@yananchen1989 can I ask which columns belongs to your changes? with reflex?

Answer 4 · 2024-04-13T04:36:20.000Z

@yananchen1989 can I ask which columns belongs to your changes? with reflex?

hello.
line 304
@theblackcat102
action = llm_chat(str(env_history) + "\n\nAction:", stop=['\n']).strip().lstrip(' ') # fix the reflexion

Answer 5 · 2024-04-13T04:43:05.000Z

i guess this could be a bug in the original code which causes the wrong conclusion with regards to webshop.
correct me if I miss something

Answer 6 · 2024-09-04T02:25:33.000Z

my tested results. fyi

@yananchen1989 Hi, may I know which model does this result come from?

Answer 7 · 2024-09-04T04:36:27.000Z

my tested results. fyi

Wow! I missed this result, but thank you for finding this issue!