arXiv:2302.00370 Abstract | arXiv Analytics

arXiv:2302.00370 [stat.ML]Abstract References Reviews Resources

How to select predictive models for causal inference?

Published 2023-02-01Version 1

Predictive models -- as with machine learning -- can underpin causal inference, to estimate the effects of an intervention at the population or individual level. This opens the door to a plethora of models, useful to match the increasing complexity of health data, but also the Pandora box of model selection: which of these models yield the most valid causal estimates? Classic machine-learning cross-validation procedures are not directly applicable. Indeed, an appropriate selection procedure for causal inference should equally weight both outcome errors for each individual, treated or not treated, whereas one outcome may be seldom observed for a sub-population. We study how more elaborate risks benefit causal model selection. We show theoretically that simple risks are brittle to weak overlap between treated and non-treated individuals as well as to heterogeneous errors between populations. Rather a more elaborate metric, the R-risk appears as a proxy of the oracle error on causal estimates, observable at the cost of an overlap re-weighting. As the R-risk is defined not only from model predictions but also by using the conditional mean outcome and the treatment probability, using it for model selection requires adapting cross validation. Extensive experiments show that the resulting procedure gives the best causal model selection.

Comments: 31 pages

Categories: stat.ML, cs.LG

Keywords: causal inference, select predictive models, risks benefit causal model selection, elaborate risks benefit causal model, causal estimates

Related articles: Most relevant | Search more

arXiv:2406.00853 [stat.ML] (Published 2024-06-02)

A Tutorial on Doubly Robust Learning for Causal Inference

Hlynur Davíð Hlynsson

arXiv:2405.09493 [stat.ML] (Published 2024-05-15)

Constrained Learning for Causal Inference and Semiparametric Statistics

Tiffany Tianhui Cai, Yuri Fonseca, Kaiwen Hou, Hongseok Namkoong

arXiv:1606.03203 [stat.ML] (Published 2016-06-10)

Causal Bandits: Learning Good Interventions via Causal Inference