EleutherAI/delphi

[Experiments] - Score explanations generated by COT and Simple explanations in GPT2

SrGonao opened this issue · 1 comments

  • Select the same features (100?) from all layers.
  • Run the simple explainer and the COT explainer using the same number of prompts and the same sentences.
  • Score the results (recal or rubric?)

Currently running the scoring using recal