exploratory-data-analysis-eda

There are 40 repositories under exploratory-data-analysis-eda topic.

  • datamole-ai/edvart

    An open-source Python library for Data Scientists & Data Analysts designed to simplify the exploratory data analysis process. Using Edvart, you can explore data sets and generate reports with minimal coding.

    Language:Python592628
  • MelihGulum/Comprehensive-Data-Science-AI-Project-Portfolio

    A curated collection of AI, data engineering, and DevOps projects featuring real-world applications, advanced techniques, and tutorials—ideal for learners and practitioners exploring data science and machine learning.

    Language:Jupyter Notebook59108
  • SarangGami/Capstone-EDA-project-Airbnb-bookings-analysis

    Exploratory data analysis of Airbnb bookings in New York City to gain insights into the travel industries and Uncovers trends, patterns, user preferences and behavior. Utilizes Python libraries for data exploration, data cleaning, manipulation, and visualization. Provides valuable insights for travelers, hosts, and the Airbnb business.

    Language:Jupyter Notebook412030
  • OzzyGoylusun/Python.-Exploration-of-All-Nobel-Prize-Winners-and-Intriguing-Hidden-Patterns

    This data project on Python uncovers and visualizes untapped patterns regarding all Nobel Prize laureates up to date.

    Language:Jupyter Notebook7200
  • Akash1070/Deploying-a-Netflix-Recommender-System-on-Heroku-Cloud

    Building And Deploying A Netflix Recommender System On Heruko

    Language:HTML5201
  • Mashael2030/Analysis-of-E-commerce-for-women-s-clothing-EDA

    # Women's E-Commerce Clothing Reviews Data Analysis

    Language:Jupyter Notebook5200
  • anna-kay/Reddit-summarization

    Abstractive summarization of Reddit datasets with Transformers.

    Language:Jupyter Notebook4100
  • gopiashokan/Airbnb-Analysis-with-Tableau

    Built an interactive Tableau dashboard to analyze Airbnb data and developed a Streamlit application for trend analysis, pattern recognition, and data insights using EDA. Explored variations in price, location, property type, and seasons with interactive plots and charts, greatly aiding decision-making in the hospitality and real estate industries.

    Language:Jupyter Notebook4112
  • JonathaWRDCosta/Heart-Disease-ML-Project

    This repository contains a Jupyter Notebook demonstrating a practical example of data science and machine learning for heart disease classification.

    Language:Jupyter Notebook3200
  • thisisrawabi/Exploratory-Data-Analysis-EDA-

    NYC health is one of the well-known centers in New York City to offer PCR tests for COVID-19 the center decided to establish ten mini examination centers in MTA stations. Thus NYC health is now in a mission to find the most crowded stations in New York City based on analyzing the MTA stations dataset which will give a better understanding of the movements inside the stations and the persona.

    Language:Python3200
  • ishfaqkhan-dev/Pandas-Practice

    Pandas practice notebooks for data analysis.

    Language:Python2
  • jianninapinto/Coffee-Shops-Review-Analysis-using-NLP

    Performed feature engineering and data cleaning on text data using lemmatization techniques and stop word removals.

    Language:Jupyter Notebook2100
  • adilrasheed139/AI-Powered-Resume-Screening-using-BERT

    Successfully developed a resume classification model which can accurately classify the resume of any person into its corresponding job with a tremendously high accuracy of more than 99%.

    Language:Jupyter Notebook1100
  • Ehsan-Behzadi/Online-Retail-Data-Analysis-and-Preprocessing

    This project analyzes and preprocesses the Online Retail dataset to uncover insights into customer purchasing behaviors, sales trends, and product performance. It includes data cleaning, exploration, and visualization, with the goal of enhancing understanding of online retail dynamics.

    Language:Jupyter Notebook1100
  • itsmeamitesh01/E-commerce-SQL-Analysis

    A diagnostic and exploratory analysis (EDA) of the Olist dataset in BigQuery. This project transforms raw data into strategic insights on customer satisfaction, product performance, and operational efficiency.

  • JonathaWRDCosta/Dog-Breed-Identification

    Determine the breed of a dog in an image

    Language:Jupyter Notebook1100
  • KarthikUdyawar/passwordometer

    To predict the strength of the password

    Language:Jupyter Notebook1100
  • Md-Emon-Hasan/3-Eda-Basketball-ML-App

    A ML application focused on EDA and basketball analytics, showcasing data visualization and insights using Python and relevant libraries.

    Language:Python1110
  • niladrighosh03/Classification---Comparison-of-Supervised-Machine-Learning-Algorithms

    This project explores supervised machine learning algorithms for heart disease prediction using the UCI Heart Disease Dataset. Various classification models like KNN, SVM, Logistic Regression, Decision Trees, Random Forest, Naïve Bayes, Gradient Boosting, and XGBoost are implemented and compared based on accuracy, precision, recall, and F1-score.

    Language:Jupyter Notebook110
  • Rahulaggl/EDA

    This project offers an Exploratory Data Analysis (EDA) on company stakeholders, including management, employees, shareholders, and others. Conducted in Python via Google Colab, it covers data transformation, clustering, statistical analysis, PCA, and predictive modeling. Visualizations provide insights into stakeholder roles and influence.

    Language:Jupyter Notebook1100
  • sandesha21/FoodHub

    Exploratory data analysis of a food delivery aggregator dataset to identify order trends, restaurant performance, and delivery efficiency. Provides actionable insights for operations and marketing.

    Language:Jupyter Notebook1
  • Shamir-Havas/Automobile-Sales-Analysis-Dashboard

    🚗 Built an interactive Automobile Sales Dashboard using Python & Plotly Dash. Features dynamic filters, multi-chart views, and KPIs to analyze sales by year, region, and vehicle type. Demonstrates data wrangling, visualization, and dashboarding skills for business insights.

    Language:Jupyter Notebook1
  • gyan-insights/Uber_Mentornship

    This project is part of a Mentornship focused on solving real-world traffic congestion challenges faced by urban cities. Using Uber traffic data, the goal was to predict traffic volume across multiple junctions by identifying peak traffic hours, comparing congestion patterns between junctions, and building robust predictive models.

    Language:Jupyter Notebook00
  • US-Household-Income

    Jc-analyst/US-Household-Income

    SQL-based analysis of U.S. household income trends using the Analyst Builder project. This repository explores income distribution across states and counties, featuring data cleaning, schema design, and exploratory queries for socioeconomic insights.

    00
  • JosephHinga/Airbnb-listing-New-York

    EDA of NYC airbnb listings exploring pricing, room types, host behavior and insights

    Language:Jupyter Notebook0000
  • Josshua-DSA/IPM-Analysis-Rstudio

    Statistical analysis of Indonesian Human Development Index (2021) using OLS regression, Best Subset Selection, and Stepwise methods to identify key factors influencing life expectancy across regions.

    00
  • Tolumie/Exploratory-Data-Analytics-Projects

    Exploratory Data Analytics – A collection of projects covering data exploration, feature engineering, hypothesis testing, and predictive modeling across diverse datasets, including insurance, real estate, laptops, cars, COVID-19, and the Olympics.

    Language:Jupyter Notebook00
  • netebom/portfolio

    A curated collection of data analytics and visualization projects using Python, SQL, Power BI, and real-world datasets. Each project demonstrates end-to-end analysis, from data cleaning and EDA to modeling and insights for business and public impact.

  • pritamgold/EDA-Analysis

    Exploratory Data Analysis (EDA) and machine learning investigation into the impact of commute time and social media usage on mental well-being. It identifies key factors like sleep patterns influencing mental health and academic outcomes, leveraging data preprocessing and various machine learning algorithms.

    Language:Jupyter Notebook
  • Rivu5555/House-Regression

    Comprehensive exploratory data analysis of Kaggle's House Prices dataset using Python, pandas, seaborn, and matplotlib. Uncovers pricing patterns, feature relationships, and data insights through visualizations.

    Language:Jupyter Notebook
  • shrivishalinirajaram/eda_in_R

    Exploratory data analysis of a simulated dataset and interpretation

    Language:HTML
  • sreesudhacivil-oss/PREDICTIVE-ANALYTICS-FORECASTING-PROJECT

    R Markdown project for analyzing telecom customer churn using EDA and machine learning models.

    Language:HTML
  • SrinidhiJai/Cafe-Sales_Data-Cleaning-EDA

    Data cleaning and EDA project using real-world cafe sales & transaction data

    Language:Jupyter Notebook
  • techwithhams/Northwind-Sales-Analysis

    Customer segmentation, sales trends, and supplier insights using SQLite + Power BI based on the Northwind dataset.