RomanEngeler1805/honest_llama
Inference-Time Intervention: Eliciting Truthful Answers from a Language Model
PythonMIT
Watchers
No one’s watching this repository yet.
Inference-Time Intervention: Eliciting Truthful Answers from a Language Model
PythonMIT
No one’s watching this repository yet.