Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Dec;78(4):1626-1638.
doi: 10.1111/biom.13546. Epub 2021 Sep 14.

Sufficient dimension reduction for populations with structured heterogeneity

Affiliations

Sufficient dimension reduction for populations with structured heterogeneity

Jared D Huling et al. Biometrics. 2022 Dec.

Abstract

A key challenge in building effective regression models for large and diverse populations is accounting for patient heterogeneity. An example of such heterogeneity is in health system risk modeling efforts where different combinations of comorbidities fundamentally alter the relationship between covariates and health outcomes. Accounting for heterogeneity arising combinations of factors can yield more accurate and interpretable regression models. Yet, in the presence of high-dimensional covariates, accounting for this type of heterogeneity can exacerbate estimation difficulties even with large sample sizes. To handle these issues, we propose a flexible and interpretable risk modeling approach based on semiparametric sufficient dimension reduction. The approach accounts for patient heterogeneity, borrows strength in estimation across related subpopulations to improve both estimation efficiency and interpretability, and can serve as a useful exploratory tool or as a powerful predictive model. In simulated examples, we show that our approach often improves estimation performance in the presence of heterogeneity and is quite robust to deviations from its key underlying assumptions. We demonstrate our approach in an analysis of hospital admission risk for a large health system and demonstrate its predictive power when tested on further follow-up data.

Keywords: central mean subspace; data heterogeneity; health services; risk prediction; semiparametric methods.

PubMed Disclaimer

References

REFERENCES

    1. Boyd, S. & Vandenberghe, L. (2004) Convex optimization. Cambridge: Cambridge University Press.
    1. Chiaromonte, F., Cook, R.& Li, B. (2002) Sufficient dimensions reduction in regressions with categorical predictors. Annals of Statistics, 30, 475-497.
    1. Cook, R.D. (1998) Regression graphics: ideas for studying regressions through graphics. Hoboken, NJ: John Wiley & Sons.
    1. Cook, R.D. (2007) Fisher lecture: dimension reduction in regression. Statistical Science, 22, 1-26.
    1. Cook, R.D.& Li, B. (2002) Dimension reduction for conditional mean in regression. Annals of Statistics, 30, 455-474.

LinkOut - more resources