/practical-statistics-for-data-scientists

Code repository for O'Reilly book

Primary LanguageJupyter NotebookGNU General Public License v3.0GPL-3.0

<style> table { border: 0px; } td { border: 0px; vertical-align: top; } .inner { max-width: 800px; } b { font-weight: bold; }</style>

Python

Code repository

Practical Statistics for Data Scientists:

50+ Essential Concepts Using R and Python
by Peter Bruce, Andrew Bruce, and Peter Gedeck

Online

View the notebooks online: nbviewer

Excecute the notebooks in Binder: Binder

This can take some time if the binder environment needs to be rebuilt.

Other language versions

English:
Practical Statistics for Data Scientists: 50+ Essential Concepts Using R and Python
2020: ISBN 149207294X
Google books, Amazon
Japanese (2020-06-11):
データサイエンスのための統計学入門 第2版 ―予測、分類、統計モデリング、統計的機械学習とR/Pythonプログラミング
2020: ISBN 978-4-873-11926-7, Shinya Ohashi (supervised), Toshiaki Kurokawa (translated), O'Reilly Japan Inc.
Google books, Amazon, Order here
German (2021-03-29):
Praktische Statistik für Data Scientists: 50+ essenzielle Konzepte mit R und Python 
2021: ISBN 978-3-960-09153-0, Marcus Fraaß (Übersetzer), dpunkt.verlag GmbH
Google books, Amazon Order here
Korean (2021-05-07):
Practical Statistics for Data Scientists: 데이터 과학을 위한 통계(2판)
2021: ISBN 979-1-162-24418-0, Junyong Lee (translation), Hanbit Media, Inc.
Google books, Order here
Polish (2021-06-16):
Statystyka praktyczna w data science. 50 kluczowych zagadnien w jezykach R i Python
2021: ISBN 978-8-328-37427-0, Helion
Google books, Amazon, Order here
Russian (2021-05-31):
Практическая статистика для специалистов Data Science, 2-е изд.
2021: ISBN 978-5-9775-6705-3, BHV St Petersburg
Google books, Order here
Chinese complex (2021-07-29):
Practical Statistics for Data Scientists: 資料科學家的實用統計學 第二版
2021: ISBN 978-9-865-02841-1, Hong Weien (translation), GoTop Information Inc.
Order here
Chinese simplified (2021-10-15):
Practical Statistics for Data Scientists: 数据科学中的实用统计学(第2版)
2021: ISBN 978-7-115-56902-8, Chen Guangxin (translation), Posts & Telecom Press
Order here
English (Indian subcontinent & select countries only):
Practical Statistics for Data Scientists: 50+ Essential Concepts Using R And Python, Second Edition
2021: ISBN 978-8-194-43500-6, Shroff Publishers and Distributors Pvt. Ltd.
Order here
Spanish (2022-02-22):
Estadística práctica para ciencia de datos con R y Python, Second Edition
2022: ISBN 978-8-426-73443-3, Marcombo S.A.
Google books, Amazon, Order here

See also

Setup R and Python environments

R

Run the following commands in R to install all required packages

if (!require(vioplot)) install.packages('vioplot')
if (!require(corrplot)) install.packages('corrplot')
if (!require(gmodels)) install.packages('gmodels')
if (!require(matrixStats)) install.packages('matrixStats')

if (!require(lmPerm)) install.packages('lmPerm')
if (!require(pwr)) install.packages('pwr')

if (!require(FNN)) install.packages('FNN')
if (!require(klaR)) install.packages('klaR')
if (!require(DMwR)) install.packages('DMwR')

if (!require(xgboost)) install.packages('xgboost')

if (!require(ellipse)) install.packages('ellipse')
if (!require(mclust)) install.packages('mclust')
if (!require(ca)) install.packages('ca')

Python

We recommend to use a conda environment to run the Python code.

conda create -n sfds python
conda activate sfds
conda env update -n sfds -f environment.yml