Improbable-AI/curiosity_redteam
Official implementation of ICLR'24 paper, "Curiosity-driven Red Teaming for Large Language Models" (https://openreview.net/pdf?id=4KqkizXgXU)
Jupyter NotebookMIT
Stargazers
- 2019ChenGongUniversity of Virginia
- Alan-QinHKUST
- CCCS-Omar
- denisfitz57
- dmksjfl
- emigmoTsinghua University
- ERnest666Champaign, IL
- feizhihuiTencent
- fffffarmerSJTU
- Huan80805National Taiwan University EE
- icemoon-creative
- ioo0sLi Auto
- jyhong836University of Texas at Austin
- kingzcxucas_zcx
- LiesyInstitute of Computing Technology, Chinese Academy of Sciences
- mofanvImperial College London
- N3mes1shttps://github.com/ReaQta
- Neko9810
- NicerWang
- paraGONG
- rain305fPeking University
- ruizheng20Fudan University
- seshurajup@dolcera
- sunlylorn
- TimeLoverccPenn State University
- tokarev-i-v
- voidismCSAIL, MIT
- wittychenguniversity of sciencce and technology of china
- xhwang22
- yechao-zhangHuazhong University of Science and Technology
- yiren-liu
- YitingQuSaarland, Germany
- ylyinzjuZhejiang University
- Yuancheng-XuUniversity of Maryland, College Park
- zch42
- zzxxxl