/honest_llama

Inference-Time Intervention: Eliciting Truthful Answers from a Language Model

Primary LanguageJupyter NotebookMIT LicenseMIT

No issues in this repository yet.