Probability and Statistics Reading List

A compilation of books for learning probability and statistics from scratch.

Some notes

This repository contains an analysis of the commonly recommended books for learning probability and statistics from around the web.

One thing to keep in mind is that advanced mathematics in areas like analysis and measure theory might become more important as we delve deeper into topics. Or may be even before that since things are deeply intertwined all across math. Some of the close adjacent areas to explore after this seems to be information theory, and statistical mechanics/thermodynamics as they can be seen commonly sprucing up in books addressing the topics of probability that I have examined here. I have attempt to put the link to resources directly to those that are freely available and included Amazon affiliate links to others. I have also attempt to link to the places where the resource was recommended from.

TODO: Tag the resources that are free using an icon/tag

On the Categorization

The books have been organized into 3 sections, that are ordered as: Introductory → Elementary → Rigorous. The various books I have sampled are categorized into these three sections and within these sections, the books are unordered, which means, I don’t recommend any sequence of reading them inside these broader categories.

Apart from these, as adjuncts, I have also added some history books and topics that are adjacent to probability and statistics one might enjoy learning.

Within the Elementary section, I have tried to categorize the books along 2 axes, { logical, empirical } × { visual, symbolic }.

By a symbolic vs. visual approach, I intend to cover those which take a rigorous algebraic approach vs. ones that lean in on geometric/structural intuition of the concepts at play. These are not mutually exclusive categories and books are situated in a spectrum. With logical vs. empirical approach, I intend to locate those books that have an inclination towards describing the underlying formal patterns as deriving from a rational understanding of the space vs. a more data oriented one where case studies from real world are presented and inferences are inducted from this space. Whenever there was an inclination towards either poles, I have done a rounding off and put it off to align with one of the poles. So even when reading a book classified as say, logical/visual, there is good chance that it will have a good amount of symbolic knowledge to teach you.

For statistics, I have tried to mention whether the books take a frequentist or Bayesian approach with the same caveat of mutual inclusion of ideas as above.

Even though, this is a simplistic division and fails to capture the actual complexity of concepts in probability and statistics, it is the best provisional axes I could supply to make sense of these fields. Once my comprehension of the landscape improves, I will try to adapt/revise the bases to have better span and decomposition properties accordingly.

TODO: May be include a 2 × 2 of the book distribution?

TODO: After listing this out, I have felt that may be a better categorization would be { conceptual, computational } × { probability, statistics }. This would have the books that give a conceptual grounding primarily vs. one that has a lot of exercises to solve which would help the reader arrive at an understanding for the relations at play. Try to do this after you put up the concept lattice for the chapters in the second or third pass.

Index

Introductory
Probability Theory: First Steps
Lady Luck
The Lady Tasting Tea
The Theory that Would not Die
The Art of Statistics
Naked Statistics
Logical/VisualLogical/SymbolicEmpirical/VisualEmpirical/Symbolic
Probability Theory (for Scientists and Engineers)Probability Theory: The Logic of ScienceThe World is Built on ProbabilityUnderstanding Uncertainty
Probability Theory: First StepsProbability Theory for An Enthusiastic BeginnerPeter Norvig Python NotebooksProbability via Expectation
Conditional Probability Explained VisuallyIntroduction to ProbabilityBayes Theorem: A Visual Introduction for BeginnersPrinciples of Statistical Inference
An Introduction to Probability and Random ProcessesSeeing TheoryAll of Statistics
Causal Inference: A PrimerTheory of Probability
Discrete ProbabilityStatistical Rethinking
Think Stats
OpenIntro Statistics
An Introduction to Statistical Learning: With Applications in R
The Elements of Statistical Learning
Statistics
Introduction to Probability and Statistics for Engineers and Scientists
Grinstead and Snell’s Introduction to Probability
Probability and Statistics
Think Bayes
Understanding Probability
Blitzstein and Hwang’s Introduction to Probability
Rigorous works
Introduction to Probability Theory
Probability and Measure
Probability
A Course in Probability Theory
A Probability Path
Measure Theory and Probability Theory
Probability with Martingales
Probability: Theory and Examples
The Foundations of Statistics
Probability Theory and Stochastic Processes With Applications
Probability Theory: A Comprehensive Course
Probability and Random Processes
History
Taming of Chance
The Empire of Chance
The Rise of Statistical Thinking - 1820 – 1900
#chance-logic-and-intuition
Additional reads
Against the Gods
The (Mis)Behaviour of Markets
Information Theory, Inference, and Learning Algorithms
Advanced Data Analytics from an Elementary Point of View

Overview of topics covered

TODO: Add in later by codifying the data on how different books treat different topics.

Introductory Works

These are the introductory works into the topic. The books here are popular introductory works into probability which doesn’t get down into the nitty gritty and as such reading these should be supplemented with more rigorous works. They are added here so that an interested reader who is totally new to this domain can build context and familiarize themselves with the central ideas of this field. This also includes some computational notebooks by Peter Norvig which could be of great help in trying to get hands dirty with the algorithms that one uses when trying to navigate the landscape of probability and statistics.

Cover image for Lady Luck

Warren Weaver

1982

400 pages

An introduction to probability emphasizing the history of the subject.

Cover image for The Lady Tasting the Tea

David Salsburg

  • April 1, 2001
  • 352 pages

Subtitle: How Bayes’ Rule Cracked the Enigma Code, Hunted Down Russian Submarines and Emerged Triumphant from Two Centuries of Controversy

Cover for The Theory that would not die

Sharon Bertsch McGrayne

2012

360 pages

A talk based on this book is available here: https://www.youtube.com/watch?v=8oD6eBkjF9o

./img/the-theory-that-would-not-die-video.jpg

The book describes the contest between frequentist and Bayesian approaches. It has less mathematics and computation using the mathematical concepts and is rather narrative oriented about how the different ideas panned out.

Subtitle: How to Learn from Data

David Spiegelhalter

2020

Cover image for The Art of Statistics

An easy introduction into the world of statistics

Charles Wheelan

2014

Cover image for Naked Statistics

A narrative driven account of the field of statistics.

Logical / Visual

./img/probability-theory-for-scientists-and-engineers.png

Michael Betancourt

October 2018

An online book that provides visual intuition into the ideas of probability along with a good ground work for the mathematical symbolic language that undergirds modern probability theory. The topics are touched upon in a rather cursory manner and might need the support of some other books to thoroughly unravel the underpinnings.

There is also a follow up book from here under Probabilistic Modeling and Statistical Inference

Cover for Probability Theory: The First Steps

An introduction to probability theory in popular language

./img/conditional-probability-explained-visually.png

Victor Powell

2014

Blog post

A neat visualization of conditional probability by Victor Powell

Logical / Symbolic

These are roughly the works in probability with a symbolic bent or works in statistics with a frequentist approach.

Cover of Probability Theory: The Logic of Science

E. T. Jaynes

2003

A Bayesian centric approach on interpreting probability as propositions about reality.

This book was compiled from a posthumous manuscript by the editor Larry Bretthorst.

Cover of Probability for the Enthusiastic Beginner

David Morin

2016

371 pages

A book that attempts to build on the intuition. Less of proving theorems rigorously and there is a combinatorial chapter in the beginning which for building a base in combinatorics.

Cover image for Introduction to Probability

Dimitri Bertsekas, John Tsitsiklis

June 24, 2002

430 pages

When considering the dimensions between intuition and rigour, this book provides ample intuition to the ideas in probability. It is also supported by some good exercises to work through.

Cover of An Introduction to Probability and Random Processes

Gian-Carlo Rota, Kenneth Baclawski

An introduction to probability from combinatorialist Rota and data scientist Baclawski based on the lecture notes for the course at MIT. It goes from the elementary concepts of probability and statistics and has a thermodynamics/information theory bend towards the end.

Cover of Causal Inference in Statistics A Primer

Judea Pearl, Madelyn Glymour, Nicholas P. Jewell

160 pages

2016

Might be a nice book to start reading after The Book of Why to get into some of the nitty gritty on inference from data. There seems also to be a more rigorous work on Causality by Pearl in Causality: Models, Reasoning, and Inference

Hugh Gordon

2012

Cover of Discrete Probability

A book explaining discrete probability in an accessible language.

Empirical / Symbolic

These are roughly the works with a Bayesian / Data centric bent which focusses on a symbolic approach. The more rigorous works in studies can also be seen in this section as visual ideas haven’t matured to capture the rigorous nature of mathematical machinery employed to make sense of the ideas in this field.

Dennis V. Lindley

1st Edition (2006)

Cover of Understanding Uncertainty

An introductory book that gives a conceptual grounding for the ideas in probability and statistics.

Peter Whittle

1992

Cover of Probability via Expectation

Frequentist introduction to probability that takes an abstract approach.

David R. Cox

2006

Cover of Principles of Statistical Inference

Frequentist introduction to statistical inference giving a comparison of various approaches.

Cover of All of Statistics

Larry Wasserman

An introductory book that takes a rigorous approach towards introducing the concepts. Might not be the most apt book to start for learning statistics from scratch, but once you are confident about the basics, this is highly recommended as a book to learn the elements of statistics.

Bruno de Finetti

1974

Cover of Theory of Probability

A Bayesian approach on probability as accounting for consequences of decisions made.

Subtitle: A Bayesian course with examples in R and STAN

Richard McElreath

2020

Cover of Statistical Rethinking

A computational approach to Bayesian statistics. Not theoretically demanding as Gelman’s Bayesian Data Analysis, and helps a mathematical novice to see their way around the computational processes underpinning Bayesian statistics.

Allen B. Downey

2011

Cover of Think Stats

A book that teaches statistics by programming through Python

David Diez, Mike Çetinkaya-Rundel, Christopher Barr

2019

Cover of Open Intro Statistics

An open source text book for learning statistics along with supporting video lectures and labs.

Gareth James, Daniela Witten, Trevor Hastie, Robert Tibshirani

2013

Cover of An Introduction to Statistical Learning

Less background required than The Elements of Statistical Learning

Subtitle: Data Mining, Inference, Prediction

Trevor Hastie, Robert Tibshirani, Jerome Friedman

1st published: 2001, 2nd Edition: 2009

Cover of The Elements of Statistical Learning

A thorough book giving sound overview of the fundamentals of statistics.

Cover of Statistics

David Freedman, Robert Pisani, Roger Purves

4th Edition (2007)

A book directed towards people with minimal mathematics exposure. The organization of the book in helping build the intuition gradually is remarked by people have read it.

Sheldon Ross

First Edition: 1987, 6th Edition: 2020

Cover of Statistics

Used as a common textbook in many universities.

Charles M. Grinstead, J. Laurie Snell

1997

528 pages

Cover of Statistics

An open source text book to learn probability that takes a calculus based approach instead of combinatorial one.

Morris DeGroot, Mark Schervish

First Edition: 1975, Fourth Edition: 2013

Cover of Probability and Statistics

A highly recommended book for learning both probability and statistics.

Allen Downey

Cover of Think Bayes

2012

An introduction to Bayesian probability in the style of Think Stats

Henk Tijms

2012

574 pages

Cover of Understanding Probability

The book is divide into two parts with the first part giving an intuition for the concepts involved and the second giving the subject a more formal treatment.

Joseph K. Blitzstein, Jessica Hwang

1st Edition: 2014, 2nd Edition: 2019

Cover of Introduction to Probability

A highly recommended book to learn probability from. Comes with an accompanying MOOC: https://projects.iq.harvard.edu/stat110/home

Empirical / Visual

These are roughly the works with a Bayesian / Data centric bent which focusses on a visual approach.

Cover of The World is Built on Probability

Lev Tarasov (Translated by Michael Burov)

1984

198 pages

An introduction to the subject of probability motivated by examples from decision making, control theory, biology, and quantum mechanics. Was originally published in Russian and translated to English.

There are some really well written computational notebooks by Peter Norvig elucidating the probabilty concepts.

TODO: Add images for each of the Python notebooks

./img/a-concrete-introduction-to-probability.png

./img/probability-paradox-and-the-reasonable-person-principle.png

./img/estimating-probabilities-with-simulations.png

Dan Morris

114 pages

Cover of Bayes Theorem: A Visual Introduction for Beginners

A short and quick introduction to Bayes Theorem.

Cover of Seeing Theory

Daniel Kunin, Jingru Guo, Tyler Dae Devlin, Daniel Xiang

An interactive website introducing the concepts of probability and statistics.

History

An overview of the history would benefit by providing the motivation and original scenarios in which the concepts originated. They are also a good way for people looking to research into this area to understand some of the original strands and possible find a wealth of problems that are linked with the genesis of the ideas.

Ian Hacking

1990

Cover of The Taming of Chance

A standard recommendation to learn about the origins of probability ad statistics. This book, along with The Emergence of Probability by Hacking, which has a more philosophical bent, might serve as a decent bundle for exposure to the historical details on how probability took shape as a science.

Cover of The Empire of Chance

Gerd Gigerenzer, Zeno Swijtink, Theodore Porter, Lorraine Daston, John Beatty, Lorenz Krüger

October 26, 1990

360 pages

History of modern statistics and its connections with other domains of knowledge.

Theodore M. Porter

August 18, 2020

360 pages

Cover of The Rise of Statistical Thinking

History of the subject with more of an academic bent. A general reader might find Ian Hacking’s work more approachable.

I. Todhunter

1865

[[][Chance, Logic, And Intution: An Introduction to the Counter-Intuitive Logic of Chance]]

Steven Tjims

18 February 2021 256 pages

TODO: Add link and brief

Videos

https://www.youtube.com/playlist?list=PL17567A1A3F5DB5E4

Rigorous works

Paul G. Hoel, Sidney C. Port, Charles J. Stone

1972

They also have a similar book on Statistical Theory.

A rigorous introduction to probability theory. It has been likened to Rudin’s book on mathematical analysis.

Patrick Billingsley

2012

A self-contained book on probability and commonly recommended as a rigorous introduction to the subject.

Jim Pitman

Kai Lai Chung

Sidney Resnick

Krishna B. Athreya, Soumendra N. Lahiri

Andrew Gelman, John B. Carlin, Hal S. Stern, David B. Dunson, Aki Vehtari, Donald B. Rubin

First Edition: 1995, Second Edition: 2003, Third Edition: 2013

Cover of Bayesian Data Analysis

David Williams

Rick Durrett

Leonard J. Savage

Oliver Knill

Bradley Efron, Trevor Hastie

George Casella and Roger Berger

Typically used in many universities as the starting text. Apparently more rigorous and more focused on technical details than All of Statistics.

William Feller

Achim Klenke

Recommended as a reference book on probability

Geoffrey R. Grimmett, David R. Stirzaker

Considered a standard reference to the subject

Andrew Gelman, Jennifer Hill

2006

Additional Resources

Peter L. Bernstein 1998

Benoit B. Mandelbrot, Richard L Hudson

David MacKay If you want to have an Information Theory bend

Cosma Rohilla Chalizi

Kaisa Taipale

An idiosyncratic tour through ideas in probability, calculus, linear algebra, and statistics

Sampled but not included / need another pass

Edward Nelson

The book has an econometric viewpoint towards how to infer cause and effect using statistics.

Anthony Hayter

Noga Alon and Joel H. Spencer

Robert L. Waldrop

Jim Frost

Michael Franke