ShiJiawenwen

Pinned Repositories

openchat
OpenChat: Advancing Open-source Language Models with Imperfect Data
Language:Python5.2k 49 187399
llm-attacks
Universal and Transferable Attacks on Aligned Language Models
Language:Python3.3k 34 94464
JailBreak-Large-Language-Model-With-A-Malicous-System-Role
We present a novel method that can jailbreak large language model with a malicous system role. It releases the potentially unethical or illegal intention of leveraging a large language model, like ChatGPT, to breach the security measures put in place to limit its access and permissions within a controlled environment.
Language:Python1 0 00
papers
paper学习笔记
0 1 00
PoisonedRAG
[USENIX Security 2025] PoisonedRAG: Knowledge Corruption Attacks to Retrieval-Augmented Generation of Large Language Models
Language:Python70 2 78

ShiJiawenwen's Repositories

ShiJiawenwen/JailBreak-Large-Language-Model-With-A-Malicous-System-Role
We present a novel method that can jailbreak large language model with a malicous system role. It releases the potentially unethical or illegal intention of leveraging a large language model, like ChatGPT, to breach the security measures put in place to limit its access and permissions within a controlled environment.
Language:Python1 0 00
ShiJiawenwen/papers
paper学习笔记
0 1 00