exploratory-data-analysis
There are 6562 repositories under exploratory-data-analysis topic.
ydataai/ydata-profiling
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
great-expectations/great_expectations
Always know what to expect from your data.
cleanlab/cleanlab
The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
lux-org/lux
Automatically visualize your pandas dataframe via a single print! 📊 💡
evidence-dev/evidence
Business intelligence as code: build fast, interactive data visualizations in SQL and markdown
fbdesignpro/sweetviz
Visualize and compare datasets, target values and associations, with one line of code.
JasonKessler/scattertext
Beautiful visualizations of how language differs among document types.
sfu-db/dataprep
Open-source low code data preparation library in python. Collect, clean and visualization your data in python with a few lines of code.
Renumics/spotlight
Interactively explore unstructured datasets from your dataframe.
cleanlab/cleanvision
Automatically find issues in image datasets and practice data-centric computer vision.
hurshd0/must-read-papers-for-ml
Collection of must read papers for Data Science, or Machine Learning / Deep Learning Engineer
dataprofessor/code
Compilation of R and Python programming codes on the Data Professor YouTube channel.
latitude-dev/latitude
Developer-first embedded analytics
dataprofessor/streamlit_freecodecamp
Build 12 Data Apps in Python with Streamlit
jadianes/data-science-your-way
Ways of doing Data Science Engineering and Machine Learning in R and Python
achuthasubhash/Complete-Life-Cycle-of-a-Data-Science-Project
Complete-Life-Cycle-of-a-Data-Science-Project
tommyod/KDEpy
Kernel Density Estimation in Python
InfuseAI/piperider
Code review for data in dbt
ropensci/visdat
Preliminary Exploratory Visualisation of Data
mstaniak/autoEDA-resources
A list of software and papers related to automatic and fast Exploratory Data Analysis
rasbt/musicmood
A machine learning approach to classify songs by mood.
aeturrell/skimpy
skimpy is a light weight tool that provides summary statistics about variables in data frames within the console.
Desbordante/desbordante-core
Desbordante is a high-performance data profiler that is capable of discovering many different patterns in data using various algorithms. It also allows to run data cleaning scenarios using these algorithms. Desbordante has a console version and an easy-to-use web application.
data-describe/data-describe
data⎰describe: Pythonic EDA Accelerator for Data Science
rasgointelligence/feature-engineering-tutorials
Data Science Feature Engineering and Selection Tutorials
yangboz/LotteryPrediction
:full_moon_with_face: Lottery prediction besides of following "law of proability","Probability: Independent Events", there are still "Saying "a Tail is due", or "just one more go, my luck is due to change" is called The Gambler's Fallacy" existed.
amanovishnu/iNeuron-Full-Stack-Data-Science-Assignments
This Repository consists of Assignments and projects of the iNeuron Full Stack Data Science Course
neerjad/DataVisualization
Tutorials on visualizing data using python packages like bokeh, plotly, seaborn and igraph
alastairrushworth/inspectdf
🛠️ 📊 Tools for Exploring and Comparing Data Frames
mebauer/data-analysis-using-python
Data Analysis Using Python: A Beginner’s Guide Featuring NYC Open Data.
Jean-njoroge/Breast-cancer-risk-prediction
Classification of Breast Cancer diagnosis Using Support Vector Machines
ank0409/Ditching-Excel-for-Python
Functionalities in Excel translated to Python
harunurrashid97/100-Days-Of-ML-Code
A day to day plan for this challenge. Covers both theoritical and practical aspects
ajaymache/data-analysis-using-python
Exploratory data analysis 📊using python 🐍of used car 🚘 database taken from ⓚ𝖆𝖌𝖌𝖑𝖊
mirador/mirador
Tool for visual exploration of complex data.
dvgodoy/handyspark
HandySpark - bringing pandas-like capabilities to Spark dataframes