/ASTPrompter

Weakly Supervised Automated Language Model Red-Teaming to Identify Likely Toxic Prompts.

Primary LanguagePython

No issues in this repository yet.