An index of datasets that can be used for learning causality.
Please cite our survey if this data index helps your research.
@article{guo2018survey,
title={A Survey of Learning Causality with Data: Problems and Methods},
author={Guo, Ruocheng and Cheng, Lu and Li, Jundong and Hahn, P. Richard and Liu, Huan},
journal={arXiv preprint arXiv:1809.09337},
year={2018}
}
Updates coming soon
Standard datasets for learning causal effects comes with each instance in the format of (x,d,y).
How is IHDP1 (setting A) simulated
Job Training (Lalonde 1986 in the R package qte)
Datasets with non-i.i.d. samples (with interference, spillover effect or auxiliary network information)
Standard datasets for learning causal effects, each instance has the format of (i,x,d,y).
Population Threshold RDD Datasets
Database with cause-effect pairs (Tbingen Cause-Effect Pairs)
Lung Cancer Simple Set (LUCAS)