statistics-for-data-science

There are 48 repositories under statistics-for-data-science topic.

  • Pegah-Ardehkhani/Statistics-and-Probability-in-Python

    A comprehensive exploration of Statistics and Probability Theory concepts, with practical implementations in Python

    Language:Jupyter Notebook1464038
  • deepr

    gagolews/deepr

    Deep R Programming (Open-Access Textbook)

  • suneelpatel/Statistics-for-Data-Science-using-Python

    Using Python, learn statistical and probabilistic approaches to understand and gain insights from data. Learn statistical concepts that are very important to Data science domain and its application using Python. Learn about Numpy, Pandas Data Frame.

    Language:Jupyter Notebook440018
  • SahibzadaSalman/Machine-Learning-with-Python

    This repository contains a gentle introduction to machine learning algorithms with hands on practical examples

    Language:Jupyter Notebook12101
  • Saba-Gul/Exploratory-Data-Analysis-and-Statistical-Analysis-Notebooks

    This repository contains a collection of Jupyter Notebooks for conducting Exploratory Data Analysis (EDA) and Statistical Analysis on various datasets.

    Language:Jupyter Notebook10104
  • sagi778/ufc_fights_Predictor

    Language:Jupyter Notebook5201
  • suneelpatel/Statistics-for-Data-Science-using-R

    Learn the core statistical concepts, followed by application of these concepts using R Studio with the a nice combination of theory and practice. Learn key statistical concepts and techniques like exploratory data analysis, correlation, regression, and inference.

  • vadimtyuryaev/RegrCoeffsExplorer

    A tool for visualizing the coefficients of various regression models, taking into account empirical data distributions.

    Language:R2100
  • dhrupad17/Statistics-and-Probability-for-Data-Science

    This is a repository containing the notes on statistics and probability for Data Science from basics to Advance

  • elizabeth-josephine/WHO-LIFE-EXPECTANCY-PROJECT

    WHO LIFE EXPECTANCY: Studying the factors that affect/contribute to life expectancy and analyzing the changes over the last 15years, that is between 2000-2015.

    Language:Jupyter Notebook10
  • emmaarenas/data-quality-analysis

    collection of Jupyter Notebooks in both English and Spanish, dedicated to performing data quality analysis using the R programming language

    Language:HTML1101
  • Honey28Git/Statistical-Methods-for-Decision-Making

    The Following problems showcase different Statistical Methods used for Decision Making. The purpose of this project is to experiment and execute statistical methods, which are required to conduct data analysis, derive insights and inferences and arrive at business decisions.

    Language:Jupyter Notebook110
  • Marrtinerz/ML-DS-Portfolio

    This portfolio features all the Data Science and Machine Learning projects I have completed for academic, self-learning and hobby purposes. Additionally, it is updated regularly.

    Language:Jupyter Notebook10
  • ragztigadi/Hypothesis-Testing-in-Data-Analysis

    Hypothesis Testing in Data Analysis This repository contains a Jupyter Notebook demonstrating various hypothesis testing techniques using Python. It covers statistical tests such as t-tests, ANOVA, chi-square, and non-parametric methods, with real-world examples and visualizations.

    Language:Jupyter Notebook1
  • AbdylGaniwu/Hypothesis-Testing-on-Heart-Disease-Data-Project

    This project uses statistical hypothesis testing to examine the link between cholesterol and fasting blood sugar levels with heart disease. One-sample t-tests and binomial tests are applied to assess whether these health metrics significantly differ from expected values, focusing on their association with heart disease.

    Language:Jupyter Notebook0100
  • AbdylGaniwu/Poisson-Distribution-Model-for-Analysis

    The Poisson distribution is a useful model for analyzing product defects, helping to estimate expected defect rates, their variability, and the likelihood of extreme cases. This understanding aids in enhancing quality control processes and minimizing defects.

    Language:Jupyter Notebook0100
  • AbdylGaniwu/Utilizing-Poisson-Distribution-for-Retail-Pharmacy-Call-Center-Insights

    The Poisson Distribution models the number of events that occur within a specified time frame, such as years. Since the volume of incoming calls fluctuates from year to year, this distribution aids in determining whether the call data aligns with a Poisson process or if external factors are affecting the call volume.

    Language:Jupyter Notebook0100
  • Immortal-Pi/AdvanceStatistics

    All Statistics concepts

    Language:Jupyter Notebook0100
  • MScDataAnalyticsFirstSemesterAssignmentTwo

    josericodata/MScDataAnalyticsFirstSemesterAssignmentTwo

    Summary of Assignment Two from the first semester of the MSc in Data Analytics program. This repository contains the CA2 assignment guidelines from the college and my submission. To see all original commits and progress, please visit the original repository using the link below.

  • JuanRuizIng/StatisticsCensusSchool

    Welcome to the U.S. Census at School Random Sampler This facility provides random data samples selected from individuals in the U.S. Census at School population that meet your selected characteristics.

    Language:Jupyter Notebook0100
  • sarahibsen/CS499

    This is a desktop Tkinter app that takes in tab delimited data and allows the user to apply statistical analysis on the data. Supports graphing.

    Language:Python00
  • saymuntasir/Applied-Regression-Problem-Set

    This repo contains all two problem set solutions of Applied Regression Course.

    Language:Jupyter Notebook0100
  • BasirS/biomedical-informatics_project

    Statistical time series analysis of hospital data to detect anomalous patterns in patient care metrics, validated against a known incident period.

    Language:R
  • BasLinders/happyhorizon_statstoolkit

    An ongoing project for an online toolkit to analyze online controlled experiments. Its mission: To make inferential statistics accessible for everyone.

    Language:Python113
  • dzhokn/statistics

    Mean, Median, Mode, Variance, Standard Deviation, Range, IQR, Skewness, Kurtosis, Multimodality, Z-score, covariance, correlation, outliers

    Language:Jupyter Notebook
  • GerMati/Subsampled-l-infinity-norm

    Sample Complexity of Bias Detection

    Language:Python
  • jennynzhuang/Regression_Property_Valuation

    Developing a Linear Regression Model for Property Valuation Data

    Language:Jupyter Notebook
  • Jonkkeyler333/EDA_dengue

    Project of data analytics

    Language:HTML1
  • JuanRuizIng/Statistical_distributions

    This repository contains the project for the course Statistics and Probability 1 from the Faculty of Engineering and Basic Sciences. The main focus of the project is the application of probability distributions such as normal, exponential, gamma, Poisson, and binomial distributions to solve real-world problems.

    Language:Jupyter Notebook10
  • Kirstinbosch/data_snoop

    Documenting some of my learning journey on Github Pages

    Language:HTML
  • ManuelCiba/longterm-lsd-ml

    Machine Learning Workflow for Detecting Longterm Drug Effects in In Vitro Neuronal Networks on Microelectrode Arrays (presented at 13th international Meeting on Neural and Electrogenic Cell Interfacing, Vienna, 2025)

    Language:Python
  • meiyulee/solutions_on_DWtest_statistic

    Book of Demythologize Durbin-Watson Test Statistic | Correct the critical values of DW statistic

  • NobuhiroMoteki/Introduction-to-Statistics

    Introductory self-learning text for interval estimation and hypothesis testing, which are the two-central topics in frequentist statistics

    Language:Jupyter Notebook
  • swapnilprasad69/From-Web-to-Wisdom-Scraping-Analyzing-Quote-Data

    This project demonstrates the end-to-end process of extracting meaningful insights from a website using web scraping, performing SQL-based data exploration, and visualizing insights using Python-based Exploratory Data Analysis (EDA).

    Language:Jupyter Notebook
  • tonyamf/Statistical_Analysis_of_Simulated_EEG_Data

    Exploratory data analysis (univariate and bivariate) to understand the characteristics of the EEG signals. To build and compare several multiple linear regression models to predict the output signal y based on the input signals.

    Language:R
  • User10166/ETEC325Library

    Official Python Library for ETEC 325 – Applied Statistics. Published by the Department of Electrical & Computer Engineering Technology – New York Institute of Technology.

    Language:Python