Sufficient dimension reduction for populations with structured heterogeneity
- PMID: 34520573
- DOI: 10.1111/biom.13546
Sufficient dimension reduction for populations with structured heterogeneity
Abstract
A key challenge in building effective regression models for large and diverse populations is accounting for patient heterogeneity. An example of such heterogeneity is in health system risk modeling efforts where different combinations of comorbidities fundamentally alter the relationship between covariates and health outcomes. Accounting for heterogeneity arising combinations of factors can yield more accurate and interpretable regression models. Yet, in the presence of high-dimensional covariates, accounting for this type of heterogeneity can exacerbate estimation difficulties even with large sample sizes. To handle these issues, we propose a flexible and interpretable risk modeling approach based on semiparametric sufficient dimension reduction. The approach accounts for patient heterogeneity, borrows strength in estimation across related subpopulations to improve both estimation efficiency and interpretability, and can serve as a useful exploratory tool or as a powerful predictive model. In simulated examples, we show that our approach often improves estimation performance in the presence of heterogeneity and is quite robust to deviations from its key underlying assumptions. We demonstrate our approach in an analysis of hospital admission risk for a large health system and demonstrate its predictive power when tested on further follow-up data.
Keywords: central mean subspace; data heterogeneity; health services; risk prediction; semiparametric methods.
© 2021 The International Biometric Society.
References
REFERENCES
-
- Boyd, S. & Vandenberghe, L. (2004) Convex optimization. Cambridge: Cambridge University Press.
-
- Chiaromonte, F., Cook, R.& Li, B. (2002) Sufficient dimensions reduction in regressions with categorical predictors. Annals of Statistics, 30, 475-497.
-
- Cook, R.D. (1998) Regression graphics: ideas for studying regressions through graphics. Hoboken, NJ: John Wiley & Sons.
-
- Cook, R.D. (2007) Fisher lecture: dimension reduction in regression. Statistical Science, 22, 1-26.
-
- Cook, R.D.& Li, B. (2002) Dimension reduction for conditional mean in regression. Annals of Statistics, 30, 455-474.
MeSH terms
LinkOut - more resources
Full Text Sources