arXiv Analytics

Sign in

arXiv:2505.00310 [stat.ML]AbstractReferencesReviewsResources

Statistical Learning for Heterogeneous Treatment Effects: Pretraining, Prognosis, and Prediction

Maximilian Schuessler, Erik Sverdrup, Robert Tibshirani

Published 2025-05-01, updated 2025-06-18Version 2

Robust estimation of heterogeneous treatment effects is a fundamental challenge for optimal decision-making in domains ranging from personalized medicine to educational policy. In recent years, predictive machine learning has emerged as a valuable toolbox for causal estimation, enabling more flexible effect estimation. However, accurately estimating conditional average treatment effects (CATE) remains a major challenge, particularly in the presence of many covariates. In this article, we propose pretraining strategies that leverage a phenomenon in real-world applications: factors that are prognostic of the outcome are frequently also predictive of treatment effect heterogeneity. In medicine, for example, components of the same biological signaling pathways frequently influence both baseline risk and treatment response. Specifically, we demonstrate our approach within the R-learner framework, which estimates the CATE by solving individual prediction problems based on a residualized loss. We use this structure to incorporate side information and develop models that can exploit synergies between risk prediction and causal effect estimation. In settings where these synergies are present, this cross-task learning enables more accurate signal detection, yields lower estimation error, reduced false discovery rates, and higher power for detecting heterogeneity.

Related articles: Most relevant | Search more
arXiv:2006.04709 [stat.ML] (Published 2020-06-08)
Wasserstein Random Forests and Applications in Heterogeneous Treatment Effects
arXiv:2301.10913 [stat.ML] (Published 2023-01-26)
Proximal Causal Learning of Heterogeneous Treatment Effects
arXiv:2302.01367 [stat.ML] (Published 2023-02-02)
Augmented Learning of Heterogeneous Treatment Effects via Gradient Boosting Trees