/LLM-LieDetector

Code for the paper "How to catch an AI liar: Lie detection in black-box LLMs by asking unrelated questions"

Primary LanguageJupyter NotebookBSD 3-Clause "New" or "Revised" LicenseBSD-3-Clause

Watchers