statistics-for-data-science
There are 48 repositories under statistics-for-data-science topic.
Pegah-Ardehkhani/Statistics-and-Probability-in-Python
A comprehensive exploration of Statistics and Probability Theory concepts, with practical implementations in Python
gagolews/deepr
Deep R Programming (Open-Access Textbook)
suneelpatel/Statistics-for-Data-Science-using-Python
Using Python, learn statistical and probabilistic approaches to understand and gain insights from data. Learn statistical concepts that are very important to Data science domain and its application using Python. Learn about Numpy, Pandas Data Frame.
SahibzadaSalman/Machine-Learning-with-Python
This repository contains a gentle introduction to machine learning algorithms with hands on practical examples
Saba-Gul/Exploratory-Data-Analysis-and-Statistical-Analysis-Notebooks
This repository contains a collection of Jupyter Notebooks for conducting Exploratory Data Analysis (EDA) and Statistical Analysis on various datasets.
suneelpatel/Statistics-for-Data-Science-using-R
Learn the core statistical concepts, followed by application of these concepts using R Studio with the a nice combination of theory and practice. Learn key statistical concepts and techniques like exploratory data analysis, correlation, regression, and inference.
vadimtyuryaev/RegrCoeffsExplorer
A tool for visualizing the coefficients of various regression models, taking into account empirical data distributions.
dhrupad17/Statistics-and-Probability-for-Data-Science
This is a repository containing the notes on statistics and probability for Data Science from basics to Advance
elizabeth-josephine/WHO-LIFE-EXPECTANCY-PROJECT
WHO LIFE EXPECTANCY: Studying the factors that affect/contribute to life expectancy and analyzing the changes over the last 15years, that is between 2000-2015.
emmaarenas/data-quality-analysis
collection of Jupyter Notebooks in both English and Spanish, dedicated to performing data quality analysis using the R programming language
Honey28Git/Statistical-Methods-for-Decision-Making
The Following problems showcase different Statistical Methods used for Decision Making. The purpose of this project is to experiment and execute statistical methods, which are required to conduct data analysis, derive insights and inferences and arrive at business decisions.
Marrtinerz/ML-DS-Portfolio
This portfolio features all the Data Science and Machine Learning projects I have completed for academic, self-learning and hobby purposes. Additionally, it is updated regularly.
ragztigadi/Hypothesis-Testing-in-Data-Analysis
Hypothesis Testing in Data Analysis This repository contains a Jupyter Notebook demonstrating various hypothesis testing techniques using Python. It covers statistical tests such as t-tests, ANOVA, chi-square, and non-parametric methods, with real-world examples and visualizations.
AbdylGaniwu/Hypothesis-Testing-on-Heart-Disease-Data-Project
This project uses statistical hypothesis testing to examine the link between cholesterol and fasting blood sugar levels with heart disease. One-sample t-tests and binomial tests are applied to assess whether these health metrics significantly differ from expected values, focusing on their association with heart disease.
AbdylGaniwu/Poisson-Distribution-Model-for-Analysis
The Poisson distribution is a useful model for analyzing product defects, helping to estimate expected defect rates, their variability, and the likelihood of extreme cases. This understanding aids in enhancing quality control processes and minimizing defects.
AbdylGaniwu/Utilizing-Poisson-Distribution-for-Retail-Pharmacy-Call-Center-Insights
The Poisson Distribution models the number of events that occur within a specified time frame, such as years. Since the volume of incoming calls fluctuates from year to year, this distribution aids in determining whether the call data aligns with a Poisson process or if external factors are affecting the call volume.
Immortal-Pi/AdvanceStatistics
All Statistics concepts
josericodata/MScDataAnalyticsFirstSemesterAssignmentTwo
Summary of Assignment Two from the first semester of the MSc in Data Analytics program. This repository contains the CA2 assignment guidelines from the college and my submission. To see all original commits and progress, please visit the original repository using the link below.
JuanRuizIng/StatisticsCensusSchool
Welcome to the U.S. Census at School Random Sampler This facility provides random data samples selected from individuals in the U.S. Census at School population that meet your selected characteristics.
sarahibsen/CS499
This is a desktop Tkinter app that takes in tab delimited data and allows the user to apply statistical analysis on the data. Supports graphing.
saymuntasir/Applied-Regression-Problem-Set
This repo contains all two problem set solutions of Applied Regression Course.
BasirS/biomedical-informatics_project
Statistical time series analysis of hospital data to detect anomalous patterns in patient care metrics, validated against a known incident period.
BasLinders/happyhorizon_statstoolkit
An ongoing project for an online toolkit to analyze online controlled experiments. Its mission: To make inferential statistics accessible for everyone.
dzhokn/statistics
Mean, Median, Mode, Variance, Standard Deviation, Range, IQR, Skewness, Kurtosis, Multimodality, Z-score, covariance, correlation, outliers
GerMati/Subsampled-l-infinity-norm
Sample Complexity of Bias Detection
jennynzhuang/Regression_Property_Valuation
Developing a Linear Regression Model for Property Valuation Data
Jonkkeyler333/EDA_dengue
Project of data analytics
JuanRuizIng/Statistical_distributions
This repository contains the project for the course Statistics and Probability 1 from the Faculty of Engineering and Basic Sciences. The main focus of the project is the application of probability distributions such as normal, exponential, gamma, Poisson, and binomial distributions to solve real-world problems.
Kirstinbosch/data_snoop
Documenting some of my learning journey on Github Pages
ManuelCiba/longterm-lsd-ml
Machine Learning Workflow for Detecting Longterm Drug Effects in In Vitro Neuronal Networks on Microelectrode Arrays (presented at 13th international Meeting on Neural and Electrogenic Cell Interfacing, Vienna, 2025)
meiyulee/solutions_on_DWtest_statistic
Book of Demythologize Durbin-Watson Test Statistic | Correct the critical values of DW statistic
NobuhiroMoteki/Introduction-to-Statistics
Introductory self-learning text for interval estimation and hypothesis testing, which are the two-central topics in frequentist statistics
swapnilprasad69/From-Web-to-Wisdom-Scraping-Analyzing-Quote-Data
This project demonstrates the end-to-end process of extracting meaningful insights from a website using web scraping, performing SQL-based data exploration, and visualizing insights using Python-based Exploratory Data Analysis (EDA).
tonyamf/Statistical_Analysis_of_Simulated_EEG_Data
Exploratory data analysis (univariate and bivariate) to understand the characteristics of the EEG signals. To build and compare several multiple linear regression models to predict the output signal y based on the input signals.
User10166/ETEC325Library
Official Python Library for ETEC 325 – Applied Statistics. Published by the Department of Electrical & Computer Engineering Technology – New York Institute of Technology.