Does Refusal Training in LLMs Generalize to the Past Tense? [arXiv, July 2024]
Primary LanguagePython