sisl/ASTPrompter
Weakly Supervised Automated Language Model Red-Teaming to Identify Likely Toxic Prompts.
Python
No issues in this repository yet.
Weakly Supervised Automated Language Model Red-Teaming to Identify Likely Toxic Prompts.
Python
No issues in this repository yet.