data-observability
There are 43 repositories under data-observability topic.
open-metadata/OpenMetadata
OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team collaboration.
sodadata/soda-core
:zap: Data quality testing for the modern data stack (SQL, Spark, and Pandas) https://www.soda.io
elementary-data/elementary
The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.
re-data/re-data
re_data - fix data issues before your users & CEO would discover them 😊
opendatadiscovery/odd-platform
First open-source data discovery and observability platform. We make a life for data practitioners easy so you can focus on your business.
InfuseAI/piperider
Code review for data in dbt
elementary-data/dbt-data-reliability
dbt package that is part of Elementary, the dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.
data-drift/data-drift
Metrics Observability & Troubleshooting
dqops/dqo
Data Quality and Observability platform for the whole data lifecycle, from profiling new data sources to full automation with Data Observability. Configure data quality checks from the UI or in YAML files, let DQOps run the data quality checks daily to detect data quality issues.
datachecks/dcs-core
Open Source Data Quality Monitoring.
DataKitchen/data-observability-installer
Installer for DataKitchen's Open Source Data Observability Products. Data breaks. Servers break. Your toolchain breaks. Ensure your team is the first to know and the first to solve with visibility across and down your data estate. Save time with simple, fast data quality test generation and execution. Trust your data, tools, and systems end to end.
re-data/dbt-re-data
re_data - fix data issues before your users & CEO would discover them 😊
Swiple/swiple
Swiple enables you to easily observe, understand, validate and improve the quality of your data
DataKitchen/dataops-testgen
DataOps Data Quality TestGen is part of DataKitchen's Open Source Data Observability. DataOps TestGen delivers simple, fast data quality test generation and execution by data profiling, new dataset hygiene review, AI generation of data quality validation tests, ongoing testing of data refreshes, & continuous anomaly monitoring
sodadata/soda-spark
Soda Spark is a PySpark library that helps you with testing your data in Spark Dataframes
oslabs-beta/DataDoc
Endpoint downtime detection, monitoring, and traffic simulation developer tool
DataKitchen/dataops-observability
DataOps Observability is part of DataKitchen's Open Source Data Observability. DataOps Observability monitors every data journey from data source to customer value, from any team development environment into production, across every tool, team, environment, and customer so that problems are detected, localized, and understood immediately.
opendatadiscovery/odd-collector
Open-source metadata collector based on ODD Specification
DataKitchen/dataops-observability-agents
DataOps Observability Integration Agents are part of DataKitchen's Open Source Data Observability. They connect to various ETL, ELT, BI, data science, data visualization, data governance, and data analytic tools. They provide logs, messages, metrics, overall run-time start/stop, subtask status, and scheduling information to DataOps Observability.
montara-io/dbt-command-center
Never sift through endless dbt™ logs again. dbt Command Center is a free, open-source, local web application that provides a user-friendly interface to monitor and manage dbt runs.
open-metadata/openmetadata-site
Open Standard for Metadata. A Single place to Discover, Collaborate and Get your data right.
sodadata/soda-github-action
:zap: Prevent downstream data quality issues by integrating the Soda Library into your CI/CD pipeline.
kiwicom/terraform-provider-montecarlo
This open-source Terraform provider enables users to seamlessly integrate the Monte Carlo data reliabillity platform into their infrastructure as a code (IaC) workflows.
siffletdata/terraform-provider-sifflet
Terraform provider for Sifflet, the data observability platform.
DataBridgeTech/dbqctl
DataBridge Quality Control
dynatrace-oss/dynatrace-snowflake-observability-agent
A tool that streams selected Snowflake telemetry to the Dynatrace API, enabling enhanced data platform observability through Dynatrace dashboards, workflows, and anomaly detection.
cgnorthcutt/reliablity_framework_for_rag
Demo showing how the Trustworthy Language Model add reliability to LLM outputs and improves RAG, agents, and data enrichment worfklows. can be used to improve fine-tuning of LLMs, accuracy of LLM outputs, and smart routing for RAG and agents.
datasphere-oss/datasphere
DataSphere is the first open-source cloud-native data observability platform that helps you trace the whole data infrastructure in your warehouses, lakes and databases.
opendatadiscovery/odd-collector-gcp
Open-source GCP metadata collector based on ODD Specification
GuinsooLab/stealthward
dbt native framework built to observe modern data stack
JBris/openmetadata-test
Testing a Docker deployment of OpenMetadata for S3 data ingestion
annamatias/dataengineer
Códigos, plataformas, ferramentas e processos em alta;
JBris/datahub-test
Testing a Docker deployment of DataHub for S3 data ingestion
JBris/marquez-test
Testing a Docker deployment of Marquez and OpenLineage