lapisrocks/rpo
Official repository for "Robust Prompt Optimization for Defending Language Models Against Jailbreaking Attacks"
Python
Stargazers
- allweonedev
- andyz2045
- andyz245
- Aniloid2A*Star
- AppleXYCISPA Helmholtz Center for Information Security
- chrisyxueUESTC
- egozverevISTA
- emigmoTsinghua University
- ericrallen@DVDAGames
- fahadshamshadMBZUAI
- glencoe2004
- grassesPhD@ZJU
- gszfwsbShanghai Jiao Tong University
- hiepbkhnBKHN
- HLiang-LeeBJTU, Microsoft STCA
- hshijiiii
- IssacRunminZhejiang university
- jiaxiaojunQAQNanyang Technological University
- jyhong836University of Texas at Austin
- Kkkaystone
- LeeOrange-is-me
- LetheSecUniversity of Science and Technology of China
- LiangSiyuan21
- lijinfeng0713
- LordogShanghai Jiao Tong University
- meet-cjli
- mihai-gheorghe@flowx-ai
- OverAny
- SCccc21Virginia Tech
- SolidShenWest Lafayette
- tanjeffreyzUniversity of California, Berkeley
- THUYimingLiZhejiang University
- xszheng2020
- yechao-zhangHuazhong University of Science and Technology
- yesdtrx
- zmackie