/Machine-Learning

Machine Learning and Causal Inference taught by Brigham Frandsen

Primary LanguageJupyter Notebook

Mixtape Sessions Banner

About

Machine Learning's wheelhouse is out-of-sample prediction, but these powerful methods can be deployed in service of causal inference. This two-session workshop will introduce the basics of machine learning prediction methods, including lasso and random forests and how they feature in causal inference methods like double machine learning (DML) and post-double selection lasso (PDS lasso). The course covers the conceptual and theoretical basis for the methods and also gets into the nuts and bolts of implementation in python and Stata using real-world data.

Schedule

Day 1

  1. What’s your question? (prediction vs. causality)

  2. Standard tools of causal inference

  • gold standard: RCT
  • Multiple Regression
  1. ML prediction tools
  • prediction objective
  • bias-variance tradeoff
  • lasso
  • random forest

Day 2

  1. Where does ML prediction fit within causal inference?
  • flexibly adjust for covariates
  • estimate heterogeneous treatment effects
  1. Post-Double Selection Lasso
  • Theory
  • Implementation
  1. Double Machine Learning
  • Theory
  • Implementation

Readings

The following is a set of introductory readings for machine learning and causal inference and is in a good potential reading order

Kleinberg, Ludwig, Mullainathan, and Obermeyer (2015)

Varian (2014)

Mullainathan and Spiess (2017)

Athey and Imbens 2019)

Belloni, Chernozhukov, and Hansen (2014)

Slides

Day 1

Day 2

Coding Labs

  1. RCT to Regression Open in Colab

  2. Predication Open in Colab

  3. Causal via Predication Open in Colab