Pinned Repositories
wmdp
WMDP is a LLM proxy benchmark for hazardous knowledge in bio, cyber, and chemical security. We also release code for RMU, an unlearning method which reduces LLM performance on WMDP while retaining general capabilities.
carving
Package to optimize Adversarial Attacks against (Large) Language Models with Varied Objectives
JailbreakingLLMs
7000
AIWorkflow-showmode
CG
clusternet
Managing your Kubernetes clusters (including public, private, edge, etc) as easily as visiting the Internet ⎈
ddcs-scripts
2021-AIWorkflow
IBM AI Workflow Project
JailTrickBench
Bag of Tricks: Benchmarking of Jailbreak Attacks on LLMs. Empirical tricks for LLM Jailbreaking. (NeurIPS 2024)
SLZ0106's Repositories
SLZ0106/7000
SLZ0106/AIWorkflow-showmode
SLZ0106/CG
SLZ0106/clusternet
Managing your Kubernetes clusters (including public, private, edge, etc) as easily as visiting the Internet ⎈
SLZ0106/ddcs-scripts