dataprofiling

There are 10 repositories under dataprofiling topic.

  • capitalone/DataProfiler

    What's in your data? Extract schema, statistics and entities from datasets

    Language:Python1.4k21181164
  • dataops-testgen

    DataKitchen/dataops-testgen

    DataOps Data Quality TestGen is part of DataKitchen's Open Source Data Observability. DataOps TestGen delivers simple, fast data quality test generation and execution by data profiling,  new dataset hygiene review, AI generation of data quality validation tests, ongoing testing of data refreshes, & continuous anomaly monitoring

    Language:Python48252
  • selva221724/edaSQL

    edaSQL is a python library to bridge the SQL with Exploratory Data Analysis where you can connect to the Database and insert the queries. The query results can be passed to the EDA tool which can give greater insights to the user.

    Language:Python10201
  • SQL-DQC

    martandsingh/SQL-DQC

    SQL based data profiling & data quality checks, which will help you to perform data profiling & data quality checks on SQL database at table & database level.

    Language:TSQL7200
  • atom071/pandas_learning

    Pandas Exercises

    Language:Jupyter Notebook0200
  • kumod007/Data-Profilling

    DATA PROFILING is a process of examining, analyzing, and creating useful summaries of data. The process yields a high-level overview which aids in the discovery of data quality issues, risks, and overall trends.

    Language:Jupyter Notebook0100
  • Ramanthan/DataCleansing_Basics

    Data Cleansing Basics

    Language:Jupyter Notebook0201
  • SanderBos1/profilerInsight

    profilerInsight is a data profiling tool designed to extract and analyze metadata from flat datafiles and different type of databases. This tool is currently under develpment

  • abhay-ak-kulkarni/Sales_Analysis

    sales_analysis

    Language:Jupyter Notebook10
  • kanishksha/sample-data-profile

    customer review on jupyter notebook

    Language:Jupyter Notebook10