LoryPack/LLM-LieDetector
Code for the ICLR 2024 paper "How to catch an AI liar: Lie detection in black-box LLMs by asking unrelated questions"
Jupyter NotebookBSD-3-Clause
Stargazers
- abacajsoftware eng building things
- aflah02Indraprastha Institute of Information Technology Delhi
- AISafety-HKUSTHKUST
- Alan-QinHKUST
- AlexTMallenSeattle, WA
- AndromedaPerseusUnited States
- asapsavSan Francisco
- ayyyq
- denisfitz57
- firstuserhere
- fly51flyPRIS
- freddiev4Quotient AI
- gm8xx8
- gsartiUniversity of Groningen
- HakeemDemiLondon UK
- IndieMinimalist
- jabogithub
- jaebooker
- jeffaraThrivus
- JeremyAlainNew York University
- jplasserLinz, Austria
- kgourgou
- L0Z1K@corca-ai
- LoryPackUniversity of Cambridge
- marcgreen
- mattbit@Giskard-AI
- NISH1001@NASA-IMPACT
- norabelroseEleutherAI
- othmaneabou@DataDog
- Perseus101
- SidUMicrosoft
- vishaal27University of Tübingen | University of Cambridge
- YueeeeeeeeUIUC
- yulonghuiPeking University
- yuta0821
- ZumusLos Angeles