data-validation
There are 504 repositories under data-validation topic.
rjsf-team/react-jsonschema-form
A React component for building Web forms from JSON Schema.
cleanlab/cleanlab
Cleanlab's open-source library is the standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
open-metadata/OpenMetadata
OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team collaboration.
evidentlyai/evidently
Evidently is an open-source ML and LLM observability framework. Evaluate, test, and monitor any AI-powered system or data pipeline. From tabular data to Gen AI. 100+ metrics.
unionai-oss/pandera
A light-weight, flexible, and expressive statistical data testing library
deepchecks/deepchecks
Deepchecks: Tests for Continuous Validation of ML Models & Data. Deepchecks is a holistic open-source solution for all of your AI & ML validation needs, enabling to thoroughly test your data and models from research to production.
pyeve/cerberus
Lightweight, extensible data validation library for Python
sodadata/soda-core
:zap: Data quality testing for the modern data stack (SQL, Spark, and Pandas) https://www.soda.io
dry-rb/dry-validation
Validation library with type-safe schemas and rules
cleanlab/cleanvision
Automatically find issues in image datasets and practice data-centric computer vision.
rstudio/pointblank
Data quality assessment and metadata reporting for data frames and database tables
biscolab/laravel-recaptcha
Google ReCaptcha package for Laravel
MigoXLab/dingo
Dingo: A Comprehensive AI Data Quality Evaluation Tool
dry-rb/dry-schema
Coercion and validation for data structures
encord-team/encord-active
The toolkit to test, validate, and evaluate your models and surface, curate, and prioritize the most valuable data for labeling.
DataRecce/recce
The data-validation toolkit for enhanced dbt (data build tool) PR review
EXXETA/openapi-cop
A proxy that validates responses and requests against an OpenAPI document. https://www.npmjs.com/package/openapi-cop https://hub.docker.com/r/lxlu/openapi-cop
lucono/xtypejs
Elegant, highly efficient data validation for JavaScript.
kjam/data-cleaning-101
Data Cleaning Libraries with Python
posit-dev/pointblank
Data validation made beautiful and powerful
shopnilsazal/validus
A dead simple Python string validation library.
implerhq/impler.io
Powerful CSV & Excel Import experience for SaaS 🚀 Save months building data import experience from scratch 💰
MAIF/eurybia
⚓ Eurybia monitors model drift over time and securizes model deployment with data validation
bids-standard/legacy-validator
Validator for the Brain Imaging Data Structure
seandstewart/typical
Typical: Fast, simple, & correct data-validation using Python 3 typing.
atrocore/atropim
AtroPIM is a modern, flexible, configurable, open-source product information management system (PIM) of a new generation.
AKSW/RDFUnit
An RDF Unit Testing Suite
datachecks/dcs-core
Open Source Data Quality Monitoring.
medtagger/MedTagger
A collaborative framework for annotating medical datasets using crowdsourcing.
databrickslabs/lakebridge
Accelerates migrations to Databricks by automating key migration activities
fiverr/passable
Declarative data validations.
target/data-validator
A tool to validate data, built around Apache Spark.
serradura/u-attributes
Create "immutable" objects with no setters, just getters.
akmalsoliev/Validoopsie
A simple and easy to use Data Validation library for Python.
Data-Liberation-Front/csvlint.io
Check that your CSV files are valid
argyle-engineering/pydantic2zod
pydantic --> zod data models