/Building-Statistical-Models-in-Python

Building Statistical Models in Python, Published by Packt

Primary LanguageJupyter NotebookMIT LicenseMIT

Packt Conference

3 Days, 20+ AI Experts, 25+ Workshops and Power Talks

Code: USD75OFF

Building Statistical Models in Python

Building Statistical Models in Python

This is the code repository for Building Statistical Models in Python, published by Packt.

Develop useful models for regression, classification, time series, and survival analysis

What is this book about?

The ability to proficiently perform statistical modeling is a fundamental skill for data scientists and essential for businesses reliant on data insights. Building Statistical Models with Python is a comprehensive guide that will empower you to leverage mathematical and statistical principles in data assessment, understanding, and inference generation.

This book covers the following exciting features:

  • Explore the use of statistics to make decisions under uncertainty
  • Answer questions about data using hypothesis tests
  • Understand the difference between regression and classification models
  • Build models with stats models in Python
  • Analyze time series data and provide forecasts
  • Discover Survival Analysis and the problems it can solve

If you feel this book is for you, get your copy today!

https://www.packtpub.com/

Instructions and Navigations

All of the code is organized into folders.

The code will look like the following:

A = [3,5,4]
B = [43,41,56,78,54]
permutation_testing(A,B,n_iter=10000)

Following is what you need for this book:

If you are looking to get started with building statistical models for your data sets, this book is for you! Building Statistical Models in Python bridges the gap between statistical theory and practical application of Python. Since you’ll take a comprehensive journey through theory and application, no previous knowledge of statistics is required, but some experience with Python will be useful.

With the following software and hardware list you can run all code files present in the book (Chapter 1-14).

Software and Hardware List

Chapter Software required OS required
1-14 Python version ≥ 3.8 Any OS
1-14 Statsmodels 0.13.2 Any OS
1-14 SciPy 1.8.1 Any OS
1-14 lifelines 0.27.4 Any OS
1-14 scikit-learn 1.1.1 Any OS
1-14 pmdarima 2.02 Any OS
1-14 Sktime 0.15.0 Any OS
1-14 Pandas 1.4.3 Any OS
1-14 Matplotlib 3.5.2 Any OS
1-14 Numpy 1.23.0 Any OS

Related products

Get to Know the Author(s)

Huy Hoang Nguyen is a mathematician and data scientist with extensive experience in advanced mathematics, strategic leadership, and applied machine learning research. He holds a PhD in Mathematics, as well as two Master’s degrees in Applied Mathematics and Data Science. His previous work focused on Partial Differential Equations, Functional Analysis, and their applications in Fluid Mechanics. After transitioning from academia to the healthcare industry, he has undertaken a variety of data science projects, ranging from traditional machine learning to deep learning.

Paul Adams is a Data Scientist with a background primarily in the healthcare industry, on both the provider and insurance sides of business. Paul applies statistics and machine learning in multiple areas of industry, focusing on projects in process engineering, process improvement, metrics and business rules development, anomaly detection, forecasting, clustering and classification. Paul holds a Master of Science in Data Science from Southern Methodist University.

Stuart Miller is a Machine Learning Engineer with degrees in Data Science, Electrical Engineering, and Engineering Physics. Stuart has worked at several Fortune 500 companies, including Texas Instruments and StateFarm, where he built software that utilized statistical and machine learning techniques. Stuart is currently an engineer at Toyota Connected helping to build a more modern cockpit experience for drivers using machine learning

Setup Instructions

To install the packages required for this book, enter the following commands in a terminal.

# clone repo 
git clone git@github.com:PacktPublishing/Implementing-Statistical-Modeling-with-Python.git

# navigate to the folder
cd Implementing-Statistical-Modeling-with-Python

# setup python environment 
# (this installs the packages used in the book)
python3 -m venv venv
source venv/bin/activate
pip install -r requirements.txt

Note: Installation instructions for Anaconda are provided in Chapter 1.