holarissun/Prompt-OIRL
code for paper Query-Dependent Prompt Evaluation and Optimization with Offline Inverse Reinforcement Learning
PythonMIT
Issues
- 7
dataset_gsm8k
#3 opened by zhaihaixu - 1
- 2
- 2
The function of held-out prompts
#2 opened by Lancelot1998