Pinned Repositories
deequ
Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.
data-pipelines-cli
CLI for data platform
deequ
Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.
data-pipelines-template-example
The project contains an example of a template to create pipeline project with GetInData Framework based on DBT
gitlab_cicd_templates
The project contains templates for CICD processes
cicd_images
Docker images to use with CICD tools
pawelpinkos's Repositories
pawelpinkos/deequ
Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.