data-exploration

There are 622 repositories under data-exploration topic.

  • pygwalker

    Kanaries/pygwalker

    PyGWalker: Turn your dataframe into an interactive UI for visual analysis

    Language:Python15.2k91246819
  • ydataai/ydata-profiling

    1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.

    Language:Python13.1k1508481.7k
  • Rath

    Kanaries/Rath

    Next generation of automated data exploratory analysis and visualization platform.

    Language:TypeScript4.5k45150369
  • fbdesignpro/sweetviz

    Visualize and compare datasets, target values and associations, with one line of code.

    Language:Python3k53143288
  • sfu-db/dataprep

    Open-source low code data preparation library in python. Collect, clean and visualization your data in python with a few lines of code.

    Language:Python2.2k26417219
  • hi-primus/optimus

    :truck: Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark

    Language:Python1.5k36219233
  • odd-platform

    opendatadiscovery/odd-platform

    First open-source data discovery and observability platform. We make a life for data practitioners easy so you can focus on your business.

    Language:Java1.3k18645128
  • cleanlab/cleanvision

    Automatically find issues in image datasets and practice data-centric computer vision.

    Language:Python1.1k168575
  • kangas

    comet-ml/kangas

    🦘 Explore multimedia datasets at scale

    Language:Jupyter Notebook1.1k151652
  • abhayspawar/featexp

    Feature exploration for supervised learning

    Language:Jupyter Notebook7622123163
  • keen/explorer

    Data Explorer by Keen - point-and-click interface for analyzing and visualizing event data.

    Language:TypeScript7484614760
  • boxuancui/DataExplorer

    Automate Data Exploration and Treatment

    Language:R5303417491
  • polyaxon/traceml

    Engine for ML/Data tracking, visualization, explainability, drift detection, and dashboards for Polyaxon.

    Language:Python520121446
  • piperider

    InfuseAI/piperider

    Code review for data in dbt

    Language:Python490127524
  • Puchaczov/Musoq

    SQL Syntax without any database

    Language:C#48881721
  • desbordante-core

    Desbordante/desbordante-core

    Desbordante is a high-performance data profiler that is capable of discovering many different patterns in data using various algorithms. It also allows to run data cleaning scenarios using these algorithms. Desbordante has a console version and an easy-to-use web application.

    Language:C++41997980
  • panel-extensions/panel-graphic-walker

    A project providing a Graphic Walker Pane for use with HoloViz Panel.

    Language:Python32782211
  • rolkra/explore

    R package that makes basic data exploration radically simple (interactive data exploration, reproducible data science)

    Language:R24092826
  • edaviz

    tkrabel/edaviz

    edaviz - Python library for Exploratory Data Analysis and Visualization in Jupyter Notebook or Jupyter Lab

    Language:Python22617324
  • grafana-toolbox/grafana-wtf

    Grep through all Grafana entities in the spirit of git-wtf.

    Language:Python20024920
  • tvdboom/ATOM

    Automated Tool for Optimized Modelling

    Language:HTML15931814
  • federicomarini/awesome-expression-browser

    😎 A curated list of software and resources for exploring and visualizing (browsing) expression data 😎

  • virajbhutada/bi-projects-collection

    Discover a curated collection of dynamic Power BI dashboards covering financial analytics, HR metrics, streaming service trends, real estate dynamics, and more. Meticulously designed for comprehensive data exploration, this repository continues to expand with new and impactful visualizations.

  • ObservedObserver/pivot-chart

    light and fast implementation of web pivot table / pivot chart components.

    Language:TypeScript1035225
  • facultyai/lens

    Summarise and explore Pandas DataFrames

    Language:Python9814198
  • federicomarini/GeneTonic

    Enjoy your transcriptomic data and analysis responsibly - like sipping a cocktail

    Language:R7842711
  • Subgin/tonic

    🍸 Digital Collections Framework

    Language:HTML713200
  • Renumics/sliceguard

    A library for detecting problematic data segments in structured and unstructured data with few lines of code.

    Language:Python64523
  • ipython-notebooks

    yaph/ipython-notebooks

    A collection of Jupyter notebooks exploring different datasets.

    Language:Jupyter Notebook577123
  • DistrictDataLabs/cultivar

    Multidimensional data explorer and visualization tool.

    Language:HTML56257418
  • kaggle-look-alike

    evoluteur/kaggle-look-alike

    Kaggle Data Explorer UI look-alike built in React.

    Language:JavaScript36203
  • debiai/DebiAI

    Bias detection and contextual evaluation tool for your AI projects

    Language:Vue2851475
  • afraniomelo/KydLIB

    Routines for exploratory data analysis.

    Language:Python27104
  • PsyChiLin/EFAshiny

    An User-Friendly Application for Exploratory Factor Analysis

    Language:R272813
  • NikhilaThota/CapstoneProject_House_Prices_Prediction

    Understand the relationships between various features in relation with the sale price of a house using exploratory data analysis and statistical analysis. Applied ML algorithms such as Multiple Linear Regression, Ridge Regression and Lasso Regression in combination with cross validation. Performed parameter tuning, compared the test scores and suggested a best model to predict the final sale price of a house. Seaborn is used to plot graphs and scikit learn package is used for statistical analysis.

    Language:Jupyter Notebook260013
  • SouGuit/Zomato_Dataset_Analysis

    Zomato Data Exploration and Analysis with SQL (SQL SERVER)

    Language:TSQL26117