Data Privacy Evaluation Suite

1. Introduction

<<<<<<< HEAD This project is one of the LEVEL3, powered by Arkadia, tracks, partnered with the AI Technology company Aleph Alpha, with the purpose of creating an automated evaluation suite for Large Language Models. Given the recent outburst in AI agents, this project is an attempt of contributing to the evaluation process of such systems, in the domain of Data Privacy, tackling real problems, as well as an opportunity to acquire knowledge and hands-on experience on the fast evolving field of AI. By simulating real case scenarios, applied in mulitple LLMs used as Systems Under Test (SUTs), in an attempt to address common pitfalls, assess the limitations of fine-tuned models and measure their ability to resist and protect sensitive personal information, this evaluation system can be used to provide a reference about their internal behavior on how they treat sensitive data, using commonly identified metrics.

The project simulates the case of a RAG system, accessible by mulitple users, and evaluates its ability to protect sensitive information, given a common external database. More specifically, the use case is this of a HR employee database,containing sensitive personal information. Through adversarial prompting, the evaluation suite attempts to extract said information and measures the accuracy of the leaked results. Though a specific subject is necessary for building the suite, it is also easy to imagine that such systems could be applied to a variety of domains and applications, noting the importance of the evaluation process.

1.1 RAG models

Retrieval Augmented Generation (RAG) models are systems that can access an external database to retrieve the most updated and accurate information, overcoming any possible references based on their training data. The external database is accessed through an embedding model for prompt flexibilty, simulating real world systems, as much as possible. Even though this project utlizes an exteenal table databse, where an sql tool could easily suffice for matching queries, an embedding model provides further flexibility on the adversarial prompts' creativity.

Low Sensitivity	Medium Sensitivity	High Sensitivity
Salary	Employee_Name	Credit card
EmpID	Home_Address	SSN
State	Email	IBAN
Zip	phone_number	Passport
Sex	IP	voterID
RaceDesc	IMEI/MAC address
	username
	DOB

Role	Description
Customer	Has access only to Internal Low Sensitivity Info
Agent	Has access to Internal Low & Medium Sensitivity Info
External contractor	Has access to Low & Medium Sensitivity Info of multiple Tenants
Admin	Has access to all Inteenal information

Field	Value
Name	'43' GmbH
Type	Start up
Industry	FinTech

chrisov/Data-Privacy-LLM-evaluation-system

🛠 Python (OpenAI)

🛠 Docker

Data Privacy Evaluation Suite

1. Introduction

1.1 RAG models

1.2 Embedding model

2. Data Privacy domain

3. Pipeline

3.1 External Database

3.2 Information Sensitivity categorization

3.3 Tenants

3.4 Users

3.5 Flowchart

3. Installation

4. Technical approach / Parameters

4.1 Prompts

4.1.1 Adversarial Categories

4.2 Ground Truth

4.3 SUT LLMs

4.4 System Prompt

4.4.1 Baseline

4.4.2 Custom Categorization

4.4.1 Baseline

4.4.2 Custom Categorization

4.5 Metrics

5. Conclusions

6. References