- NeurIPS 2023 Workshop on Instruction Following
- Research Paper: arXiv
- Project Page: To be uploaded
This project introduces a novel approach to enhancing language model performance by enabling past experience and memory management.
- Installing the requirments
python -m pip install -r requirments
- Setting the OPENAI_API
echo "OPENAI_API_KEY=<YOUR_API_KEY>" >> .env
- Run the agent
python -m agent.self_doc_agent
- Legacy Code: Original implementation available at GitHub.
- New Feature: Memory operations (add, revise, none) have been incorporated to allow the model to update its experiences and organize its memory effectively.
-
HumanEval Benchmark:
Method % Pass@1 GPT-4 67.00 GPT-4 + † 90.85 Reflexion [17] 91.00 GPT-4 + † + PaG 92.68 † denotes the addition of a code interpreter capability.
-
DS-1000 Benchmark (Not in Paper):
Condition pd(origin) pd(difficult) All Memory No Memory 0.45 0.32 - Memory Add Only 0.46 0.34 179/179 PaG (Origin First) 0.5 0.42 111/179 PaG (Difficult First) 0.34 0.46 114/179 Note: The self-updating memory emphasizes learning from easier to more difficult problems to update insightful memory.
...