data-exploration
There are 622 repositories under data-exploration topic.
Kanaries/pygwalker
PyGWalker: Turn your dataframe into an interactive UI for visual analysis
ydataai/ydata-profiling
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
Kanaries/Rath
Next generation of automated data exploratory analysis and visualization platform.
fbdesignpro/sweetviz
Visualize and compare datasets, target values and associations, with one line of code.
sfu-db/dataprep
Open-source low code data preparation library in python. Collect, clean and visualization your data in python with a few lines of code.
hi-primus/optimus
:truck: Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark
opendatadiscovery/odd-platform
First open-source data discovery and observability platform. We make a life for data practitioners easy so you can focus on your business.
cleanlab/cleanvision
Automatically find issues in image datasets and practice data-centric computer vision.
comet-ml/kangas
🦘 Explore multimedia datasets at scale
abhayspawar/featexp
Feature exploration for supervised learning
keen/explorer
Data Explorer by Keen - point-and-click interface for analyzing and visualizing event data.
boxuancui/DataExplorer
Automate Data Exploration and Treatment
polyaxon/traceml
Engine for ML/Data tracking, visualization, explainability, drift detection, and dashboards for Polyaxon.
InfuseAI/piperider
Code review for data in dbt
Puchaczov/Musoq
SQL Syntax without any database
Desbordante/desbordante-core
Desbordante is a high-performance data profiler that is capable of discovering many different patterns in data using various algorithms. It also allows to run data cleaning scenarios using these algorithms. Desbordante has a console version and an easy-to-use web application.
panel-extensions/panel-graphic-walker
A project providing a Graphic Walker Pane for use with HoloViz Panel.
rolkra/explore
R package that makes basic data exploration radically simple (interactive data exploration, reproducible data science)
tkrabel/edaviz
edaviz - Python library for Exploratory Data Analysis and Visualization in Jupyter Notebook or Jupyter Lab
grafana-toolbox/grafana-wtf
Grep through all Grafana entities in the spirit of git-wtf.
tvdboom/ATOM
Automated Tool for Optimized Modelling
federicomarini/awesome-expression-browser
😎 A curated list of software and resources for exploring and visualizing (browsing) expression data 😎
virajbhutada/bi-projects-collection
Discover a curated collection of dynamic Power BI dashboards covering financial analytics, HR metrics, streaming service trends, real estate dynamics, and more. Meticulously designed for comprehensive data exploration, this repository continues to expand with new and impactful visualizations.
ObservedObserver/pivot-chart
light and fast implementation of web pivot table / pivot chart components.
facultyai/lens
Summarise and explore Pandas DataFrames
federicomarini/GeneTonic
Enjoy your transcriptomic data and analysis responsibly - like sipping a cocktail
Subgin/tonic
🍸 Digital Collections Framework
Renumics/sliceguard
A library for detecting problematic data segments in structured and unstructured data with few lines of code.
yaph/ipython-notebooks
A collection of Jupyter notebooks exploring different datasets.
DistrictDataLabs/cultivar
Multidimensional data explorer and visualization tool.
evoluteur/kaggle-look-alike
Kaggle Data Explorer UI look-alike built in React.
debiai/DebiAI
Bias detection and contextual evaluation tool for your AI projects
afraniomelo/KydLIB
Routines for exploratory data analysis.
PsyChiLin/EFAshiny
An User-Friendly Application for Exploratory Factor Analysis
NikhilaThota/CapstoneProject_House_Prices_Prediction
Understand the relationships between various features in relation with the sale price of a house using exploratory data analysis and statistical analysis. Applied ML algorithms such as Multiple Linear Regression, Ridge Regression and Lasso Regression in combination with cross validation. Performed parameter tuning, compared the test scores and suggested a best model to predict the final sale price of a house. Seaborn is used to plot graphs and scikit learn package is used for statistical analysis.
SouGuit/Zomato_Dataset_Analysis
Zomato Data Exploration and Analysis with SQL (SQL SERVER)