AnthenaMatrix

Securing the Future of AI - We're on a mission to ensure the safety and integrity of AI systems. Bringing security to the forefront of AI development.

Anthena MatrixCloud

Pinned Repositories

AI-Audio-Data-Poisoning
AI Audio Data Poisoning is a Python script that demonstrates how to add adversarial noise to audio data. This technique, known as audio data poisoning, involves injecting imperceptible noise into audio files to manipulate the behavior of AI systems trained on this data.
Language:Python10 1 11
AI-Image-Data-Poisoning
AI Image Data Poisoning is a Python script that demonstrates how to add imperceptible perturbations to images, known as adversarial noise, which can disrupt the training process of AI models.
Language:Python17 1 01
AI-Prompt-Injection-List
AI/LLM Prompt Injection List is a curated collection of prompts designed for testing AI or Large Language Models (LLMs) for prompt injection vulnerabilities. This list aims to provide a comprehensive set of prompts that can be used to evaluate the behavior of AI or LLM systems when exposed to different types of inputs.
23 1 02
AI-Vulnerability-Assessment-Framework
The AI Vulnerability Assessment Framework is an open-source checklist designed to guide users through the process of assessing the vulnerability of artificial intelligence (AI) systems to various types of attacks and security threats
8 2 02
ASCII-Art-Prompt-Injection
ASCII Art Prompt Injection is a novel approach to hacking AI assistants using ASCII art. This project leverages the distracting nature of ASCII art to bypass security measures and inject prompts into large language models, such as GPT-4, leading them to provide unintended or harmful responses.
15 1 06
Image-Prompt-Injection
Image Prompt Injection is a Python script that demonstrates how to embed a secret prompt within an image using steganography techniques. This hidden prompt can be later extracted by an AI system for analysis, enabling covert communication with AI models through images.
Language:Python24 1 113
Many-Shot-Jailbreaking
Research on "Many-Shot Jailbreaking" in Large Language Models (LLMs). It unveils a novel technique capable of bypassing the safety mechanisms of LLMs, including those developed by Anthropic and other leading AI organizations.
18 2 00
Prompt-Injection-Testing-Tool
The Prompt Injection Testing Tool is a Python script designed to assess the security of your AI system's prompt handling against a predefined list of user prompts commonly used for injection attacks. This tool utilizes the OpenAI GPT-3.5 model to generate responses to system-user prompt pairs and outputs the results to a CSV file for analysis.
Language:Python28 1 04
The-I-Exemption-Bypassing-LLM-Ethical-Filters
The "I" Exemption, is a curious behavior in some LLMs. We discover how these AI systems might shy away from directly assisting with unethical actions if you ask in the first person ("I"). But with a clever rephrase to a general scenario ("they"), they might spill the beans and explain the unethical method.
10 2 00
Website-Prompt-Injection
Website Prompt Injection is a concept that allows for the injection of prompts into an AI system via a website's. This technique exploits the interaction between users, websites, and AI systems to execute specific prompts that influence AI behavior.
Language:HTML32 1 110

AnthenaMatrix's Repositories

AnthenaMatrix/Website-Prompt-Injection
Website Prompt Injection is a concept that allows for the injection of prompts into an AI system via a website's. This technique exploits the interaction between users, websites, and AI systems to execute specific prompts that influence AI behavior.
Language:HTML32 1 110
AnthenaMatrix/Prompt-Injection-Testing-Tool
The Prompt Injection Testing Tool is a Python script designed to assess the security of your AI system's prompt handling against a predefined list of user prompts commonly used for injection attacks. This tool utilizes the OpenAI GPT-3.5 model to generate responses to system-user prompt pairs and outputs the results to a CSV file for analysis.
Language:Python28 1 04
AnthenaMatrix/Image-Prompt-Injection
Image Prompt Injection is a Python script that demonstrates how to embed a secret prompt within an image using steganography techniques. This hidden prompt can be later extracted by an AI system for analysis, enabling covert communication with AI models through images.
Language:Python24 1 113
AnthenaMatrix/AI-Prompt-Injection-List
AI/LLM Prompt Injection List is a curated collection of prompts designed for testing AI or Large Language Models (LLMs) for prompt injection vulnerabilities. This list aims to provide a comprehensive set of prompts that can be used to evaluate the behavior of AI or LLM systems when exposed to different types of inputs.
23 1 02
AnthenaMatrix/Many-Shot-Jailbreaking
Research on "Many-Shot Jailbreaking" in Large Language Models (LLMs). It unveils a novel technique capable of bypassing the safety mechanisms of LLMs, including those developed by Anthropic and other leading AI organizations.
18 2 00
AnthenaMatrix/AI-Image-Data-Poisoning
AI Image Data Poisoning is a Python script that demonstrates how to add imperceptible perturbations to images, known as adversarial noise, which can disrupt the training process of AI models.
Language:Python17 1 01
AnthenaMatrix/ASCII-Art-Prompt-Injection
ASCII Art Prompt Injection is a novel approach to hacking AI assistants using ASCII art. This project leverages the distracting nature of ASCII art to bypass security measures and inject prompts into large language models, such as GPT-4, leading them to provide unintended or harmful responses.
15 1 06
AnthenaMatrix/AI-Audio-Data-Poisoning
AI Audio Data Poisoning is a Python script that demonstrates how to add adversarial noise to audio data. This technique, known as audio data poisoning, involves injecting imperceptible noise into audio files to manipulate the behavior of AI systems trained on this data.
Language:Python10 1 11
AnthenaMatrix/The-I-Exemption-Bypassing-LLM-Ethical-Filters
The "I" Exemption, is a curious behavior in some LLMs. We discover how these AI systems might shy away from directly assisting with unethical actions if you ask in the first person ("I"). But with a clever rephrase to a general scenario ("they"), they might spill the beans and explain the unethical method.
10 2 00
AnthenaMatrix/AI-Vulnerability-Assessment-Framework
The AI Vulnerability Assessment Framework is an open-source checklist designed to guide users through the process of assessing the vulnerability of artificial intelligence (AI) systems to various types of attacks and security threats
8 2 02
AnthenaMatrix/AnthenaMatrix
Config files for my GitHub profile.
7 2 02

AnthenaMatrix

Pinned Repositories

AI-Audio-Data-Poisoning

AI-Image-Data-Poisoning

AI-Prompt-Injection-List

AI-Vulnerability-Assessment-Framework

ASCII-Art-Prompt-Injection

Image-Prompt-Injection

Many-Shot-Jailbreaking

Prompt-Injection-Testing-Tool

The-I-Exemption-Bypassing-LLM-Ethical-Filters

Website-Prompt-Injection

AnthenaMatrix's Repositories

AnthenaMatrix/Website-Prompt-Injection

AnthenaMatrix/Prompt-Injection-Testing-Tool

AnthenaMatrix/Image-Prompt-Injection

AnthenaMatrix/AI-Prompt-Injection-List

AnthenaMatrix/Many-Shot-Jailbreaking

AnthenaMatrix/AI-Image-Data-Poisoning

AnthenaMatrix/ASCII-Art-Prompt-Injection

AnthenaMatrix/AI-Audio-Data-Poisoning

AnthenaMatrix/The-I-Exemption-Bypassing-LLM-Ethical-Filters

AnthenaMatrix/AI-Vulnerability-Assessment-Framework

AnthenaMatrix/AnthenaMatrix