itsmemala/honest_llama
Inference-Time Intervention: Eliciting Truthful Answers from a Language Model
Jupyter NotebookMIT
Stargazers
No one’s star this repository yet.
Inference-Time Intervention: Eliciting Truthful Answers from a Language Model
Jupyter NotebookMIT
No one’s star this repository yet.