pii-detection
There are 43 repositories under pii-detection topic.
microsoft/presidio
Context aware, pluggable and customizable data protection and de-identification SDK for text and images
redhuntlabs/Octopii
An AI-powered Personal Identifiable Information (PII) scanner.
google/magritte
Mediapipe-based library to redact faces from videos and images
awslabs/sensitive-data-protection-on-aws
The Sensitive Data Protection on AWS solution allows enterprise customers to create data catalogs, discover, protect, and visualize sensitive data across multiple AWS accounts. The solution eliminates the need for manual tagging to track sensitive data such as Personal Identifiable Information (PII) and classified information.
databrickslabs/discoverx
A Swiss-Army-knife for your Data Intelligence platform administration.
EdyVision/pii-codex
A research python package for detecting, categorizing, and assessing the severity of personal identifiable information (PII)
apicrafter/metacrafter
Metadata and data identification tool and Python library. Identifies PII, common identifiers, language specific identifiers. Fully customizable and flexible rules
edwardcooper/piidetect
A package to build an end-to-end pipeline for detecting personally identifiable information from text.
akazah/prompt-anonymizer
Anonymize / mask personal information before sending prompts to chat AI (like ChatGPT provided by OpenAI)
Akshay7591/Web-Scanner
Web Scanner written in Python which after scanning the given URL returns it's domain name, ip address, nmap scan results and also the contents the URL's robots.txt.
fvaleye/metadata-guardian
Provide an easy way with Python to protect your data sources by searching its metadata. 🛡️
apicrafter/metacrafter-registry
Registry of metadata identifier entities like UUID, GUID, person fullname, address and so on. Linked with other sources
edwardcooper/data-sentry
A project to build a machine learning pipeline to detect personal identifiable information (PII)
dotfurther/OpenDiscoverSDK
.NET 8 API for document file format identification, text/metadata/attachment/embedded object/sensitive item (PII/PHI)/entity extraction.
DataFog/datafog-python
Open source PII detection and anonymization tool: easy-to-use, configurable, and extensible
dotfurther/OpenDiscoverPlatformCaseStudy
Case study using dotfurther's Open Discover Platform with the RavenDB document store to rapidly create a full-text search/eDiscovery/information governance capable demonstration application.
arcjet/example-nextjs
An example Next.js application protected by Arcjet.
HabaneroCake/pii-filter
A personally identifiable information (PII) filter.
mddunlap924/PII-Detection
Personal Identifiable Information (PII) entity detection and performance enhancement with synthetic data generation
DataFog/codexify
An open-source API that identifies, masks, and replaces Personallly Identifying Information (PII)
gretelai/multi-table
Notebook and code to synthesize relational databases such as Postgres and Mysql.
oxytis/oxidize
Discover PII sensitive data. Find most common personally identifiable information in your environment such as financial related information. Quickly determine exposure after a breach.
Lizhecheng02/Kaggle-PII_Data_Detection
Implement named entity recognition (NER) using regex and fine-tuned LLM, with a total of 15 categories. The ultimate goal is to apply the model to detect personally identifiable information (PII) in student writing.
mns-llc/bitsnarf
Finds useful information in English/US strings using regex with a focus on PII.
bballamudi/data-sentry
A project to build a machine learning pipeline to detect personal identifiable information (PII)
aws-samples/aws-appconfig-pii-extn
Sample AWS AppConfig Extension integrating with Amazon Comprehend for PII detection
BhavyaMPatel/SecureScanner
This is a PII Masker application where user can mask their pdf and make use of it
caesarw0/sanityze
Spot & Redact PII from Pandas data frames
EmediongFrancis/alx-backend-user-data
Repository of projects involving user data.
hperer02/PII-data-detection
This project was developed for a Kaggle competition focused on detecting Personally Identifiable Information (PII) in student writing. The primary objective was to build a robust model capable of identifying PII with high recall. The DeBERTa v3 transformer model was chosen for this task after comparing its performance with other transformer models.
michael-ortiz/terraform-aws-s3-audio-pii-guardian
🕵️♂️ Personally Identifiable Information (PII) Detection and Redaction for Voice Audio Files Stored in S3 and AWS Transcribe
arcjet/example-nestjs
An example NestJS application protected by Arcjet.
arcjet/example-remix
An example Remix application protected by Arcjet.
ausdfrost/anonymizePy
🌱 anonymizePy helps you anonymize your data with ease
caesarw0/sanityzeR
Spot & Redact PII from R data frames/Tibbles
david-acker/redact-pii
Redact PII from images with Azure, OpenAI, and SkiaSharp