pii
There are 205 repositories under pii topic.
microsoft/presidio
An open-source framework for detecting, redacting, masking, and anonymizing sensitive data (PII) across text, images, and structured data. Supports NLP, pattern matching, and customizable pipelines.
CatchTheTornado/text-extract-api
Document (PDF, Word, PPTX ...) extraction and parse API using state of the art modern OCRs + Ollama supported models. Anonymize documents. Remove PII. Convert any document or picture to structured JSON or Markdown
capitalone/DataProfiler
What's in your data? Extract schema, statistics and entities from datasets
securitybunker/databunker
Secure Vault for Customer PII/PHI/PCI/KYC Records
redhuntlabs/Octopii
An AI-powered Personal Identifiable Information (PII) scanner.
rohitcoder/hawk-eye
A powerful scanner to scan your Filesystem, S3, MySQL, Redis, Google Cloud Storage and Firebase storage for PII and sensitive data.
tokern/piicatcher
Scan databases and data warehouses for PII data. Tag tables and columns in data catalogs like Amundsen and Datahub
thoughtbot/top_secret
Filter sensitive information from free text before sending it to external services or APIs, such as chatbots and LLMs.
microsoft/presidio-research
This package features data-science related tasks for developing new recognizers for Presidio. It is used for the evaluation of the entire system, as well as for evaluating specific PII recognizers or PII detection models.
samber/slog-formatter
🚨 slog: Attribute formatting
GoogleCloudPlatform/dlp-dataflow-deidentification
Multi Cloud Data Tokenization Solution By Using Dataflow and Cloud DLP
EdyVision/pii-codex
A research python package for detecting, categorizing, and assessing the severity of personal identifiable information (PII)
klouddb/klouddbshield
KloudDB Shield is a comprehensive Postgres Security Tool - PII Scanner , CIS Benchmarks , SSL audit , 12+ features .. Supports Postgres, RDS ,Aurora, MySQL
philterd/phileas
The open source PII and PHI redaction and de-identification engine
amanvirparhar/elara
A simple tool to anonymize LLM prompts.
rpgeeganage/pII-guard
🛡️ PII Guard is an LLM-powered tool that detects and manages Personally Identifiable Information (PII) in logs — designed to support data privacy and GDPR compliance
cxumol/promptmask
Never give AI companies your secrets! A local LLM-based privacy filter for LLM users. Seamless integration with your existing AI tools as a Python library / OpenAI SDK replacement / API Gatetway / Web Server.
polentino/redacted
Scala library and compiler plugin that prevent inadvertent leakage of sensitive fields in `case classes` (such as credentials, personal data, and other confidential information)
open-privacy/opv
Open Privacy Vault - Secure, Performant, Open Source PII as a Service.
deliciousinsights/mongoose-pii
A Mongoose plugin that lets you transparently cipher stored PII and use securely-hashed passwords
edwardcooper/piidetect
A package to build an end-to-end pipeline for detecting personally identifiable information from text.
PovertyAction/PII_detection
Application and python script to identify, remove, and/or recode personally identifiable information (PII) from field experiment datasets.
apicrafter/metacrafter
Metadata and data identification tool and Python library. Identifies PII, common identifiers, language specific identifiers. Fully customizable and flexible rules
jftuga/deidentification
Deidentify people's names and gender specific pronouns
AgenticA5/A5-PII-Anonymizer
Desktop App with Built-In LLM for Removing Personal Identifiable Information in Documents
mddunlap924/PII-Detection
Personal Identifiable Information (PII) entity detection and performance enhancement with synthetic data generation
Poogles/piiregex
Search for PII in Python
primait/veil
Rust derive macro for redacting sensitive data in std::fmt::Debug
AidanSpeakss/streamer-mode-for-firefox
Hides personal information from pages, similar to Discord's Streamer mode.
MLukman/Keycloak-PII-Data-Encryption-Provider
A Keycloak provider that enables encryption of user attributes that contain PII data to be automatically encrypted upon storing to database and then decrypted upon loading from database
aliengiraffe/deidentify
Simple yet powerful tool for identifying and anonymizing personal information in various formats.
nightfallai/nightfall-python-sdk
Python Data Loss Prevention (DLP) SDK - Nightfall Developer Platform
seanpedrick-case/doc_redaction
Redact PDF/image-based documents, or CSV/XLSX files using a Gradio-based GUI interface
kylemclaren/scrub
A Python package to scrub PII
Stuub/GitHush
Detecting leaked secrets, API keys, credentials, and sensitive files from public repositories in near real-time using the GitHub Events API
ipcrypt-std/ipcrypt2
A tiny, portable implementation of the IPCrypt specification in C.