jujipotle/honest_llama
Inference-Time Intervention: Eliciting Truthful Answers from a Language Model
PythonMIT
Stargazers
No one’s star this repository yet.
Inference-Time Intervention: Eliciting Truthful Answers from a Language Model
PythonMIT
No one’s star this repository yet.