itsmemala/honest_llama
Inference-Time Intervention: Eliciting Truthful Answers from a Language Model
Jupyter NotebookMIT
No issues in this repository yet.
Inference-Time Intervention: Eliciting Truthful Answers from a Language Model
Jupyter NotebookMIT
No issues in this repository yet.