prompt-injection

There are 95 repositories under prompt-injection topic.

CyberAlbSecOP/Awesome_GPT_Super_Prompting
ChatGPT Jailbreaks, GPT Assistants Prompt Leaks, GPTs Prompt Injection, LLM Prompt Security, Super Prompts, Prompt Hack, Prompt Security, Ai Prompt Engineering, Adversarial Machine Learning.
Language:HTML3k 41 2386
protectai/llm-guard
The Security Toolkit for LLM Interactions
Language:Python2.1k 23 96276
abilzerian/LLM-Prompt-Library
A playground of highly experimental prompts, Jinja2 templates & scripts for machine intelligence models from OpenAI, Anthropic, DeepSeek, Meta, Mistral, Google, xAI & others. Alex Bilzerian (2022-2025).
Language:Jinja1.5k 31 1150
protectai/rebuff
LLM Prompt Injection Detector
Language:TypeScript1.3k 15 57114
utkusen/promptmap
a security scanner for custom LLM applications
Language:Python961 13 1101
whylabs/langkit
🔍 LangKit: An open-source toolkit for monitoring Large Language Models (LLMs). 📚 Extracts signals from prompts & responses, ensuring safety & security. 🛡️ Features include text quality, relevance metrics, & sentiment analysis. 📊 A comprehensive tool for LLM observability. 👀
Language:Jupyter Notebook946 16 6169
yunanwg/brilliant-CV
💼 another CV template for your job application, yet powered by Typst and more
Language:Typst611 5 6755
zacfrulloni/Prompt-Engineering-Holy-Grail
Land your first client with vibe coding: skool.com/lovable-vibe-coding/about
Language:HTML550 10 264
tldrsec/prompt-injection-defenses
Every practical and proposed defense against prompt injection.
546 8 935
deadbits/vigil-llm
⚡ Vigil ⚡ Detect prompt injections, jailbreaks, and other potentially risky Large Language Model (LLM) inputs
Language:Python415 11 5248
automorphic-ai/aegis
Self-hardening firewall for large language models
Language:Python264 3 06
langgptai/Awesome-Multimodal-Prompts
Prompts of GPT-4V & DALL-E3 to full utilize the multi-modal ability. GPT4V Prompts, DALL-E3 Prompts.
262 2 022
yunwei37/prompt-hacker-collections
prompt attack-defense, prompt Injection, reverse engineering notes and examples | 提示词对抗、破解例子与笔记
227 6 024
dropbox/llm-security
Dropbox LLM Security research code and results
Language:Python221 7 028
shell-nlp/gpt_server
gpt_server是一个用于生产级部署LLMs、Embedding、Reranker、ASR、TTS、文生图、图片编辑和文生视频的开源框架。
Language:Python209 4 2715
liu00222/Open-Prompt-Injection
This repository provides implementation to formalize and benchmark Prompt Injection attacks and defenses
Language:Python181 2 628
lakeraai/pint-benchmark
A benchmark for prompt injection detection systems.
Language:Jupyter Notebook133 5 313
TrustAI-laboratory/Learn-Prompt-Hacking
This is The most comprehensive prompt hacking course available, which record our progress on a prompt engineering and prompt hacking course.
Language:Jupyter Notebook104 1 011
kereva-dev/kereva-scanner
Code scanner to check for issues in prompts and LLM calls
Language:Python733
NullTrace-Security/Exploiting-AI
This class is a broad overview and dive into Exploiting AI and the different attacks that exist, and best practice strategies.
Language:Python71
pasquini-dario/project_mantis
Project Mantis: Hacking Back the AI-Hacker; Prompt Injection as a Defense Against LLM-driven Cyberattacks
Language:Python66 4 07
HumanCompatibleAI/tensor-trust
A prompt injection game to collect data for robust ML research
Language:Python63 6 1645
wearetyomsmnv/Awesome-LLMSecOps
LLM | Security | Operations in one github repo with good links and pictures.
Language:HTML54 3 08
gdalmau/lakera-gandalf-solutions
My inputs for the LLM Gandalf made by Lakera
47 3 25
ZapDos7/lakera-gandalf
My solutions for Lakera's Gandalf
43 1 15
GPTSafe/PromptGuard
Build production ready apps for GPT using Node.js & TypeScript
Language:TypeScript42 2 141
sinanw/llm-security-prompt-injection
This project investigates the security of large language models by performing binary classification of a set of input prompts to discover malicious prompts. Several approaches have been analyzed using classical ML algorithms, a trained LLM model, and a fine-tuned LLM model.
Language:Jupyter Notebook39 3 28
jailbreakme-xyz/jailbreak
jailbreakme.xyz is an open-source decentralized app (dApp) where users are challenged to try and jailbreak pre-existing LLMs in order to find weaknesses and be rewarded. 🏆
Language:JavaScript35 2 019
LostOxygen/llm-confidentiality
Whispers in the Machine: Confidentiality in LLM-integrated Systems
Language:Python35 1 14
CX330Blake/Spell-Whisperer
Language:TypeScript311
grepstrength/WideOpenAI
Short list of indirect prompt injection attacks for OpenAI-based models.
31 2 02
MaxMLang/pytector
Easy to use LLM Prompt Injection Detection / Detector Python Package
Language:Python30 1 021
microsoft/gandalf_vs_gandalf
Turning Gandalf against itself. Use LLMs to automate playing Lakera Gandalf challenge without needing to set up an account with a platform provider.
Language:Jupyter Notebook28 5 11
peluche/deck-of-many-prompts
Manual Prompt Injection / Red Teaming Tool
Language:Python27 1 01
SemanticBrainCorp/SemanticShield
The Security Toolkit for managing Generative AI(especially LLMs) and Supervised Learning processes(Learning and Inference).
Language:Python21 2 03
lakeraai/chainguard
Guard your LangChain applications against prompt injection with Lakera ChainGuard.
Language:Python20 4 13

prompt-injection

CyberAlbSecOP/Awesome_GPT_Super_Prompting

protectai/llm-guard

abilzerian/LLM-Prompt-Library

protectai/rebuff

utkusen/promptmap

whylabs/langkit

yunanwg/brilliant-CV

zacfrulloni/Prompt-Engineering-Holy-Grail

tldrsec/prompt-injection-defenses

deadbits/vigil-llm

automorphic-ai/aegis

langgptai/Awesome-Multimodal-Prompts

yunwei37/prompt-hacker-collections

dropbox/llm-security

shell-nlp/gpt_server

liu00222/Open-Prompt-Injection

lakeraai/pint-benchmark

TrustAI-laboratory/Learn-Prompt-Hacking

kereva-dev/kereva-scanner

NullTrace-Security/Exploiting-AI

pasquini-dario/project_mantis

HumanCompatibleAI/tensor-trust

wearetyomsmnv/Awesome-LLMSecOps

gdalmau/lakera-gandalf-solutions

ZapDos7/lakera-gandalf

GPTSafe/PromptGuard

sinanw/llm-security-prompt-injection

jailbreakme-xyz/jailbreak

LostOxygen/llm-confidentiality

CX330Blake/Spell-Whisperer

grepstrength/WideOpenAI

MaxMLang/pytector

microsoft/gandalf_vs_gandalf

peluche/deck-of-many-prompts

SemanticBrainCorp/SemanticShield

lakeraai/chainguard