hcnet23's Stars
e9t/nsmc
Naver sentiment movie corpus
songys/AwesomeKorean_Data
한국어 데이터 세트 링크
ForestHouse2316/gksdudaovld
QWERTY 키보드 한영 자판 매핑 프로그램 / Mapping program for conversion between KO-EN on QWERTY keyboard
aurelio-labs/semantic-router
Superfast AI decision making and intelligent processing of multi-modal data.
tokern/piicatcher
Scan databases and data warehouses for PII data. Tag tables and columns in data catalogs like Amundsen and Datahub
thoughtworks-datakind/anonymizer
Library for identification, anonymization and de-anonymization of PII data
microsoft/presidio
Context aware, pluggable and customizable data protection and de-identification SDK for text and images
DataFog/codexify
An open-source API that identifies, masks, and replaces Personallly Identifying Information (PII)
awslabs/sensitive-data-protection-on-aws
The Sensitive Data Protection on AWS solution allows enterprise customers to create data catalogs, discover, protect, and visualize sensitive data across multiple AWS accounts. The solution eliminates the need for manual tagging to track sensitive data such as Personal Identifiable Information (PII) and classified information.
EdyVision/pii-codex
A research python package for detecting, categorizing, and assessing the severity of personal identifiable information (PII)
edwardcooper/piidetect
A package to build an end-to-end pipeline for detecting personally identifiable information from text.
edwardcooper/data-sentry
A project to build a machine learning pipeline to detect personal identifiable information (PII)
apicrafter/metacrafter-registry
Registry of metadata identifier entities like UUID, GUID, person fullname, address and so on. Linked with other sources
akazah/prompt-anonymizer
Anonymize / mask personal information before sending prompts to chat AI (like ChatGPT provided by OpenAI)
redhuntlabs/Octopii
An AI-powered Personal Identifiable Information (PII) scanner.
Zhe-Young/WICL
Code for EMNLP 2023 Findings paper: "Not All Demonstration Examples are Equally Beneficial: Reweighting Demonstration Examples for In-Context Learning"
songys/Toxic_comment_data
Naver sentiment movie corpus v1.0_감성분석 레이블링 상세화
korquad/korquad.github.io
Korean wiki QA dataset for MRC
github/CodeSearchNet
Datasets, tools, and benchmarks for representation learning of code.
TikhonJelvis/RL-book
langchain-ai/langchain
🦜🔗 Build context-aware reasoning applications
dair-ai/Prompt-Engineering-Guide
🐙 Guides, papers, lecture, notebooks and resources for prompt engineering
obi-ml-public/ehr_deidentification
Robust de-identification of medical notes using transformer architectures
uclanlp/awesome-fairness-papers
Papers on fairness in NLP
rominf/profanity-filter
A Python library for detecting and filtering profanity
ashishps1/awesome-leetcode-resources
Awesome LeetCode resources to learn Data Structures and Algorithms and prepare for Coding Interviews.
Sanjeev-Thiyagarajan/fastapi-course-practice
protectai/rebuff
LLM Prompt Injection Detector