pii-detection

There are 43 repositories under pii-detection topic.

  • microsoft/presidio

    Context aware, pluggable and customizable data protection and de-identification SDK for text and images

    Language:Python3.8k71416575
  • redhuntlabs/Octopii

    An AI-powered Personal Identifiable Information (PII) scanner.

    Language:Python643111054
  • google/magritte

    Mediapipe-based library to redact faces from videos and images

    Language:C++43914916
  • awslabs/sensitive-data-protection-on-aws

    The Sensitive Data Protection on AWS solution allows enterprise customers to create data catalogs, discover, protect, and visualize sensitive data across multiple AWS accounts. The solution eliminates the need for manual tagging to track sensitive data such as Personal Identifiable Information (PII) and classified information.

    Language:TypeScript11115210
  • databrickslabs/discoverx

    A Swiss-Army-knife for your Data Intelligence platform administration.

    Language:Python10752211
  • EdyVision/pii-codex

    A research python package for detecting, categorizing, and assessing the severity of personal identifiable information (PII)

    Language:Python733610
  • apicrafter/metacrafter

    Metadata and data identification tool and Python library. Identifies PII, common identifiers, language specific identifiers. Fully customizable and flexible rules

    Language:Python443275
  • edwardcooper/piidetect

    A package to build an end-to-end pipeline for detecting personally identifiable information from text.

    Language:Python43239
  • akazah/prompt-anonymizer

    Anonymize / mask personal information before sending prompts to chat AI (like ChatGPT provided by OpenAI)

    Language:Python20101
  • Akshay7591/Web-Scanner

    Web Scanner written in Python which after scanning the given URL returns it's domain name, ip address, nmap scan results and also the contents the URL's robots.txt.

    Language:Python20113
  • metadata-guardian

    fvaleye/metadata-guardian

    Provide an easy way with Python to protect your data sources by searching its metadata. 🛡️

    Language:Python17311
  • apicrafter/metacrafter-registry

    Registry of metadata identifier entities like UUID, GUID, person fullname, address and so on. Linked with other sources

    Language:Python162520
  • edwardcooper/data-sentry

    A project to build a machine learning pipeline to detect personal identifiable information (PII)

    Language:Jupyter Notebook16209
  • dotfurther/OpenDiscoverSDK

    .NET 8 API for document file format identification, text/metadata/attachment/embedded object/sensitive item (PII/PHI)/entity extraction.

    Language:C#15110
  • DataFog/datafog-python

    Open source PII detection and anonymization tool: easy-to-use, configurable, and extensible

    Language:Python11263
  • dotfurther/OpenDiscoverPlatformCaseStudy

    Case study using dotfurther's Open Discover Platform with the RavenDB document store to rapidly create a full-text search/eDiscovery/information governance capable demonstration application.

  • arcjet/example-nextjs

    An example Next.js application protected by Arcjet.

    Language:TypeScript102
  • HabaneroCake/pii-filter

    A personally identifiable information (PII) filter.

    Language:TypeScript10121
  • mddunlap924/PII-Detection

    Personal Identifiable Information (PII) entity detection and performance enhancement with synthetic data generation

    Language:Python10102
  • DataFog/codexify

    An open-source API that identifies, masks, and replaces Personallly Identifying Information (PII)

    Language:Python9201
  • gretelai/multi-table

    Notebook and code to synthesize relational databases such as Postgres and Mysql.

    Language:Jupyter Notebook82111
  • oxytis/oxidize

    Discover PII sensitive data. Find most common personally identifiable information in your environment such as financial related information. Quickly determine exposure after a breach.

    Language:Go7102
  • Lizhecheng02/Kaggle-PII_Data_Detection

    Implement named entity recognition (NER) using regex and fine-tuned LLM, with a total of 15 categories. The ultimate goal is to apply the model to detect personally identifiable information (PII) in student writing.

    Language:Jupyter Notebook5101
  • mns-llc/bitsnarf

    Finds useful information in English/US strings using regex with a focus on PII.

    Language:Python5103
  • bballamudi/data-sentry

    A project to build a machine learning pipeline to detect personal identifiable information (PII)

    Language:Jupyter Notebook4100
  • aws-samples/aws-appconfig-pii-extn

    Sample AWS AppConfig Extension integrating with Amazon Comprehend for PII detection

    Language:Python130
  • BhavyaMPatel/SecureScanner

    This is a PII Masker application where user can mask their pdf and make use of it

    Language:JavaScript1100
  • caesarw0/sanityze

    Spot & Redact PII from Pandas data frames

    Language:Python100
  • EmediongFrancis/alx-backend-user-data

    Repository of projects involving user data.

    Language:Python11017
  • hperer02/PII-data-detection

    This project was developed for a Kaggle competition focused on detecting Personally Identifiable Information (PII) in student writing. The primary objective was to build a robust model capable of identifying PII with high recall. The DeBERTa v3 transformer model was chosen for this task after comparing its performance with other transformer models.

    Language:Jupyter Notebook1100
  • michael-ortiz/terraform-aws-s3-audio-pii-guardian

    🕵️‍♂️ Personally Identifiable Information (PII) Detection and Redaction for Voice Audio Files Stored in S3 and AWS Transcribe

    Language:TypeScript1
  • arcjet/example-nestjs

    An example NestJS application protected by Arcjet.

    Language:TypeScript00
  • arcjet/example-remix

    An example Remix application protected by Arcjet.

    Language:TypeScript00
  • ausdfrost/anonymizePy

    🌱 anonymizePy helps you anonymize your data with ease

    Language:Python0100
  • caesarw0/sanityzeR

    Spot & Redact PII from R data frames/Tibbles

    Language:R00
  • david-acker/redact-pii

    Redact PII from images with Azure, OpenAI, and SkiaSharp

    Language:C#10