pii-detection
There are 40 repositories under pii-detection topic.
microsoft/presidio
Context aware, pluggable and customizable data protection and de-identification SDK for text and images
redhuntlabs/Octopii
An AI-powered Personal Identifiable Information (PII) scanner.
google/magritte
Mediapipe-based library to redact faces from videos and images
awslabs/sensitive-data-protection-on-aws
The Sensitive Data Protection on AWS solution allows enterprise customers to create data catalogs, discover, protect, and visualize sensitive data across multiple AWS accounts. The solution eliminates the need for manual tagging to track sensitive data such as Personal Identifiable Information (PII) and classified information.
databrickslabs/discoverx
A Swiss-Army-knife for your Data Intelligence platform administration.
EdyVision/pii-codex
A research python package for detecting, categorizing, and assessing the severity of personal identifiable information (PII)
apicrafter/metacrafter
Metadata and data identification tool and Python library. Identifies PII, common identifiers, language specific identifiers. Fully customizable and flexible rules
edwardcooper/piidetect
A package to build an end-to-end pipeline for detecting personally identifiable information from text.
akazah/prompt-anonymizer
Anonymize / mask personal information before sending prompts to chat AI (like ChatGPT provided by OpenAI)
Akshay7591/Web-Scanner
Web Scanner written in Python which after scanning the given URL returns it's domain name, ip address, nmap scan results and also the contents the URL's robots.txt.
fvaleye/metadata-guardian
Provide an easy way with Python to protect your data sources by searching its metadata. 🛡️
apicrafter/metacrafter-registry
Registry of metadata identifier entities like UUID, GUID, person fullname, address and so on. Linked with other sources
edwardcooper/data-sentry
A project to build a machine learning pipeline to detect personal identifiable information (PII)
dotfurther/OpenDiscoverSDK
.NET 6 API for document file format identification, text/metadata/attachment/embedded object/sensitive item (PII/PHI)/entity extraction.
dotfurther/OpenDiscoverPlatformCaseStudy
Case study using dotfurther's Open Discover Platform with the RavenDB document store to rapidly create a full-text search/eDiscovery/information governance capable demonstration application.
HabaneroCake/pii-filter
A personally identifiable information (PII) filter.
DataFog/codexify
An open-source API that identifies, masks, and replaces Personallly Identifying Information (PII)
DataFog/datafog-python
Open source PII detection and anonymization tool: easy-to-use, configurable, and extensible
gretelai/multi-table
Notebook and code to synthesize relational databases such as Postgres and Mysql.
mddunlap924/PII-Detection
Personal Identifiable Information (PII) entity detection and performance enhancement with synthetic data generation
oxytis/oxidize
Discover PII sensitive data. Find most common personally identifiable information in your environment such as financial related information. Quickly determine exposure after a breach.
Lizhecheng02/Kaggle-PII_Data_Detection
Implement named entity recognition (NER) using regex and fine-tuned LLM, with a total of 15 categories. The ultimate goal is to apply the model to detect personally identifiable information (PII) in student writing.
mns-llc/bitsnarf
Finds useful information in English/US strings using regex with a focus on PII.
bballamudi/data-sentry
A project to build a machine learning pipeline to detect personal identifiable information (PII)
aws-samples/aws-appconfig-pii-extn
Sample AWS AppConfig Extension integrating with Amazon Comprehend for PII detection
BhavyaMPatel/SecureScanner
This is a PII Masker application where user can mask their pdf and make use of it
caesarw0/sanityze
Spot & Redact PII from Pandas data frames
chchench/pii-detect
Objective-C sample code for detecting PII such as SSN and credit card numbers
EmediongFrancis/alx-backend-user-data
Repository of projects involving user data.
hperer02/PII-data-detection
This project was developed for a Kaggle competition focused on detecting Personally Identifiable Information (PII) in student writing. The primary objective was to build a robust model capable of identifying PII with high recall. The DeBERTa v3 transformer model was chosen for this task after comparing its performance with other transformer models.
ausdfrost/anonymizePy
🌱 anonymizePy helps you anonymize your data with ease
CogNetSys/Sonarum
Sonarum revolutionizes human-machine communication by securing real-time text, audio, and video streams while remaining fast, secure, and lightweight. It detects and controls sensitive and secure data on-the-fly, ensuring privacy and security without compromising quality.
stevelange17/AzureDevOpsPIIScan.CLI
Bare bones code meant for sample/education only.
caesarw0/sanityzeR
Spot & Redact PII from R data frames/Tibbles
david-acker/redact-pii
Redact PII from images with Azure, OpenAI, and SkiaSharp