Improving prediction of linear regression models by integrating external information from heterogeneous populations: James-Stein estimators
- PMID: 39101548
- PMCID: PMC11299067
- DOI: 10.1093/biomtc/ujae072
Improving prediction of linear regression models by integrating external information from heterogeneous populations: James-Stein estimators
Abstract
We consider the setting where (1) an internal study builds a linear regression model for prediction based on individual-level data, (2) some external studies have fitted similar linear regression models that use only subsets of the covariates and provide coefficient estimates for the reduced models without individual-level data, and (3) there is heterogeneity across these study populations. The goal is to integrate the external model summary information into fitting the internal model to improve prediction accuracy. We adapt the James-Stein shrinkage method to propose estimators that are no worse and are oftentimes better in the prediction mean squared error after information integration, regardless of the degree of study population heterogeneity. We conduct comprehensive simulation studies to investigate the numerical performance of the proposed estimators. We also apply the method to enhance a prediction model for patella bone lead level in terms of blood lead level and other covariates by integrating summary information from published literature.
Keywords: James–Stein shrinkage; data integration; external summary information; meta-analysis; population heterogeneity; prediction mean squared error.
© The Author(s) 2024. Published by Oxford University Press on behalf of The International Biometric Society.
Conflict of interest statement
None declared.
Figures
References
-
- Baranchik A. J. (1970). A family of minimax estimators of the mean of a multivariate normal distribution. Annals of Mathematical Statistics, 41, 642–645.
-
- Boot T. (2020). Confidence regions for averaging estimators. https://econ.wisc.edu/wp-content/uploads/sites/89/2020/11/Boot-2020-Conf.... [Accessed June 2024].
-
- Burger D. E., Milder F. L., Morsillo P. R., Adams B. B., Hu H. (1990). Automated bone lead analysis by k-X-ray fluorescence for the clinical environment. Basic Life Sciences, 55, 287–92. - PubMed
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
