llm-security

There are 70 repositories under llm-security topic.

pathwaycom/llm-app
Ready-to-run cloud templates for RAG, AI pipelines, and enterprise search with live data. 🐳Docker-friendly.⚡Always in sync with Sharepoint, Google Drive, S3, Kafka, PostgreSQL, real-time data APIs, and more.
8.6k 31 16271
Giskard-AI/giskard
🐢 Open-Source Evaluation & Testing for AI & LLM systems
Language:Python4.2k 33 463278
NVIDIA/garak
the LLM vulnerability scanner
Language:Python3.1k 31 602267
verazuo/jailbreak_llms
[CCS'24] A dataset consists of 15,140 ChatGPT prompts from Reddit, Discord, websites, and open-source datasets (including 1,405 jailbreak prompts).
Language:Jupyter Notebook2.8k 36 8257
protectai/llm-guard
The Security Toolkit for LLM Interactions
Language:Python1.3k 19 68169
msoedov/agentic_security
Agentic LLM Vulnerability Scanner / AI red teaming kit
Language:Python869 12 990
mariocandela/beelzebub
A secure low code honeypot framework, leveraging AI for System Virtualization.
Language:Go710 12 955
EasyJailbreak/EasyJailbreak
An easy-to-use Python framework to generate adversarial jailbreak prompts.
Language:Python516 8 3041
chawins/llm-sp
Papers and resources related to the security and privacy of LLMs 🤖
Language:Python460 17 834
deadbits/vigil-llm
⚡ Vigil ⚡ Detect prompt injections, jailbreaks, and other potentially risky Large Language Model (LLM) inputs
Language:Python331 11 5236
R3DRUN3/sploitcraft
🏴‍☠️ Hacking Guides, Demos and Proof-of-Concepts 🥷
Language:Jupyter Notebook185 4 026
liu00222/Open-Prompt-Injection
This repository provides implementation to formalize and benchmark Prompt Injection attacks and defenses
Language:Python160 2 620
phantasmlabs/phantasm
Toolkits to create a human-in-the-loop approval layer to monitor and guide AI agents workflow in real-time.
Language:Svelte1527
yevh/TaaC-AI
AI-driven Threat modeling-as-a-Code (TaaC-AI)
Language:HTML120 5 313
ZenGuard-AI/fast-llm-security-guardrails
The fastest && easiest LLM security guardrails for AI Agents and applications.
Language:Python110 2 313
arekusandr/last_layer
Ultra-fast, low latency LLM prompt injection/jailbreak detection ⛓️
Language:Python109 2 33
raga-ai-hub/raga-llm-hub
Framework for LLM evaluation, guardrails and security
Language:Python100 2 310
lakeraai/pint-benchmark
A benchmark for prompt injection detection systems.
Language:Jupyter Notebook94 5 310
pdparchitect/llm-hacking-database
This repository contains various attack against Large Language Models.
83 4 04
llm-platform-security/SecGPT
SecGPT: An execution isolation architecture for LLM-based systems
Language:Python53 3 17
microsoft/BIPIA
A benchmark for evaluating the robustness of LLMs and defenses to indirect prompt injection attacks.
Language:Python53 6 65
NaniDAO/ie
intents engine
Language:Solidity53 6 06
azminewasi/Awesome-LLMs-ICLR-24
It is a comprehensive resource hub compiling all LLM papers accepted at the International Conference on Learning Representations (ICLR) in 2024.
43 1 01
briland/LLM-security-and-privacy
LLM security and privacy
Language:TeX43 2 07
RomiconEZ/llamator
Framework for testing vulnerabilities of large language models (LLM).
Language:Python372
sinanw/llm-security-prompt-injection
This project investigates the security of large language models by performing binary classification of a set of input prompts to discover malicious prompts. Several approaches have been analyzed using classical ML algorithms, a trained LLM model, and a fine-tuned LLM model.
Language:Jupyter Notebook35 3 27
SEC-CAFE/handbook
安全手册，企业安全实践、攻防与安全研究知识库
Language:CSS34 1 04
leondz/lm_risk_cards
Risks and targets for assessing LLMs & LLM vulnerabilities
Language:Python30 6 08
LostOxygen/llm-confidentiality
Whispers in the Machine: Confidentiality in LLM-integrated Systems
Language:Python30 2 14
llm-platform-security/chatgpt-plugin-eval
LLM Platform Security: Applying a Systematic Evaluation Framework to OpenAI's ChatGPT Plugins
Language:HTML25 2 17
TrustAI-laboratory/Learn-Prompt-Hacking
This is The most comprehensive prompt hacking course available, which record our progress on a prompt engineering and prompt hacking course.
Language:Jupyter Notebook24 1 00
google/litmus
Litmus is a comprehensive LLM testing and evaluation tool designed for GenAI Application Development. It provides a robust platform with a user-friendly UI for streamlining the process of building and assessing the performance of your LLM-powered applications.
Language:Vue21 3 13
dapurv5/awesome-red-teaming-llms
Repository accompanying the paper https://arxiv.org/abs/2407.14937
18 4 01
lakeraai/chainguard
Guard your LangChain applications against prompt injection with Lakera ChainGuard.
Language:Python18 6 02
levitation-opensource/Manipulative-Expression-Recognition
MER is a software that identifies and highlights manipulative communication in text from human conversations and AI-generated responses. MER benchmarks language models for manipulative expressions, fostering development of transparency and safety in AI. It also supports manipulation victims by detecting manipulative patterns in human communication.
Language:HTML13 4 03
jiangnanboy/llm_security
利用分类法和敏感词检测法对生成式大模型的输入和输出内容进行安全检测，尽早识别风险内容。The input and output contents of generative large model are checked by classification method and sensitive word detection method to identify content risk as early as possible.
Language:Java10

llm-security

pathwaycom/llm-app

Giskard-AI/giskard

NVIDIA/garak

verazuo/jailbreak_llms

protectai/llm-guard

msoedov/agentic_security

mariocandela/beelzebub

EasyJailbreak/EasyJailbreak

chawins/llm-sp

deadbits/vigil-llm

R3DRUN3/sploitcraft

liu00222/Open-Prompt-Injection

phantasmlabs/phantasm

yevh/TaaC-AI

ZenGuard-AI/fast-llm-security-guardrails

arekusandr/last_layer

raga-ai-hub/raga-llm-hub

lakeraai/pint-benchmark

pdparchitect/llm-hacking-database

llm-platform-security/SecGPT

microsoft/BIPIA

NaniDAO/ie

azminewasi/Awesome-LLMs-ICLR-24

briland/LLM-security-and-privacy

RomiconEZ/llamator

sinanw/llm-security-prompt-injection

SEC-CAFE/handbook

leondz/lm_risk_cards

LostOxygen/llm-confidentiality

llm-platform-security/chatgpt-plugin-eval

TrustAI-laboratory/Learn-Prompt-Hacking

google/litmus

dapurv5/awesome-red-teaming-llms

lakeraai/chainguard

levitation-opensource/Manipulative-Expression-Recognition

jiangnanboy/llm_security