Clarification on ReflexionStrategy Implementation

Question

Clarification on ReflexionStrategy Implementation

zapqqqwe opened this issue a year ago · 0 comments

I hope this message finds you well. I am currently exploring the ReflexionStrategy in your project located at https://github.com/noahshinn024/reflexion in the hotpotqa_runs/notebook section.
Specifically, I am interested in understanding the behavior of the strategy when it comes to the "LAST_ATTEMPT_AND_REFLEXION" mode.
According to the code at print(ReflexionStrategy.doc), the description for "LAST_ATTEMPT_AND_REFLEXION" is as follows:
LAST_ATTEMPT_AND_REFLEXION: Use last reasoning trace in context and apply reflexion to the next reasoning trace. However, upon examining the implementation in agent.py at line 116, it appears that the strategy does not combine the results of reflexion for each iteration but only considers the last reflexion results of the previous trace.
Could you kindly clarify if this is indeed the intended behavior, or if there may be a misunderstanding in the description? It seems to me that the strategy should be named "LAST_ATTEMPT_AND_LAST_REFLEXION" if it only takes the last reflexion results of the previous trace into account.
I appreciate your assistance in clarifying this aspect of the ReflexionStrategy implementation. Thank you for your time and support.