Distributional anchor regression
- PMID: 35582000
- PMCID: PMC9106647
- DOI: 10.1007/s11222-022-10097-z
Distributional anchor regression
Abstract
Prediction models often fail if train and test data do not stem from the same distribution. Out-of-distribution (OOD) generalization to unseen, perturbed test data is a desirable but difficult-to-achieve property for prediction models and in general requires strong assumptions on the data generating process (DGP). In a causally inspired perspective on OOD generalization, the test data arise from a specific class of interventions on exogenous random variables of the DGP, called anchors. Anchor regression models, introduced by Rothenhäusler et al. (J R Stat Soc Ser B 83(2):215-246, 2021. 10.1111/rssb.12398), protect against distributional shifts in the test data by employing causal regularization. However, so far anchor regression has only been used with a squared-error loss which is inapplicable to common responses such as censored continuous or ordinal data. Here, we propose a distributional version of anchor regression which generalizes the method to potentially censored responses with at least an ordered sample space. To this end, we combine a flexible class of parametric transformation models for distributional regression with an appropriate causal regularizer under a more general notion of residuals. In an exemplary application and several simulation scenarios we demonstrate the extent to which OOD generalization is possible.
Keywords: Anchor regression; Covariate shift; Diluted causality; Distributional regression; Out-of-distribution generalization; Transformation models.
© The Author(s) 2022.
Figures













References
-
- Aalen O, Borgan O, Gjessing H. Survival and Event History Analysis: A Process Point of View. Berlin: Springer; 2008.
-
- Abadi, M. et al.: TensorFlow: Large-Scale Machine Learning on Heterogeneous Systems. https://tensorflow.org/, software available from tensorflow.org (2015)
-
- Angrist JD, Imbens GW, Rubin DB. Identification of causal effects using instrumental variables. J. Am. Stat. Assoc. 1996;91(434):444–455. doi: 10.1080/01621459.1996.10476902. - DOI
-
- Arjovsky, M., Bottou, L., Gulrajani, I., Lopez-Paz, D.: Invariant risk minimization. arXiv preprint arXiv:1907.02893 (2019)
-
- Azzalini A, Bowman AW. A look at some data on the old faithful Geyser. Appl. Stat. 1990;39(3):357. doi: 10.2307/2347385. - DOI
LinkOut - more resources
Full Text Sources