Assessing the goodness of fit of logistic regression models in large samples: A modification of the Hosmer-Lemeshow test
- PMID: 32134502
- DOI: 10.1111/biom.13249
Assessing the goodness of fit of logistic regression models in large samples: A modification of the Hosmer-Lemeshow test
Abstract
Evaluating the goodness of fit of logistic regression models is crucial to ensure the accuracy of the estimated probabilities. Unfortunately, such evaluation is problematic in large samples. Because the power of traditional goodness of fit tests increases with the sample size, practically irrelevant discrepancies between estimated and true probabilities are increasingly likely to cause the rejection of the hypothesis of perfect fit in larger and larger samples. This phenomenon has been widely documented for popular goodness of fit tests, such as the Hosmer-Lemeshow test. To address this limitation, we propose a modification of the Hosmer-Lemeshow approach. By standardizing the noncentrality parameter that characterizes the alternative distribution of the Hosmer-Lemeshow statistic, we introduce a parameter that measures the goodness of fit of a model but does not depend on the sample size. We provide the methodology to estimate this parameter and construct confidence intervals for it. Finally, we propose a formal statistical test to rigorously assess whether the fit of a model, albeit not perfect, is acceptable for practical purposes. The proposed method is compared in a simulation study with a competing modification of the Hosmer-Lemeshow test, based on repeated subsampling. We provide a step-by-step illustration of our method using a model for postneonatal mortality developed in a large cohort of more than 300 000 observations.
Keywords: Hosmer-Lemeshow test; calibration; goodness of fit; large samples; logistic regression; noncentrality parameter.
© 2020 The International Biometric Society.
Comment in
-
Discussion of "Assessing the goodness-of-fit of logistic regression models in large samples: A modification of the Hosmer-Lemeshow test," by Giovanni Nattino, Michael L. Pennell, and Stanley Lemeshow.Biometrics. 2020 Jun;76(2):569-571. doi: 10.1111/biom.13255. Epub 2020 Apr 6. Biometrics. 2020. PMID: 32251523 No abstract available.
-
Discussion on "Assessing the goodness of fit of logistic regression models in large samples: A modification of the Hosmer-Lemeshow test" by Giovanni Nattino, Michael L. Pennell, and Stanley Lemeshow.Biometrics. 2020 Jun;76(2):572-574. doi: 10.1111/biom.13248. Epub 2020 Apr 6. Biometrics. 2020. PMID: 32251529 Free PMC article. No abstract available.
-
Discussion on "Assessing the goodness of fit of logistic regression models in large samples: A modification of the Hosmer-Lemeshow test" by Giovanni Nattino, Michael L. Pennell, and Stanley Lemeshow.Biometrics. 2020 Jun;76(2):561-563. doi: 10.1111/biom.13257. Epub 2020 Apr 6. Biometrics. 2020. PMID: 32251532 No abstract available.
-
Rejoinder to "Assessing the goodness of fit of logistic regression models in large samples: A modification of the Hosmer-Lemeshow test".Biometrics. 2020 Jun;76(2):575-577. doi: 10.1111/biom.13250. Epub 2020 Apr 6. Biometrics. 2020. PMID: 32251533 No abstract available.
-
Discussion on "Assessing the goodness of fit of logistic regression models in large samples: A modification of the Hosmer-Lemeshow test" by Giovanni Nattino, Michael L. Pennell, and Stanley Lemeshow.Biometrics. 2020 Jun;76(2):564-568. doi: 10.1111/biom.13251. Epub 2020 Apr 6. Biometrics. 2020. PMID: 32251538 No abstract available.
Similar articles
-
Standardizing the power of the Hosmer-Lemeshow goodness of fit test in large data sets.Stat Med. 2013 Jan 15;32(1):67-80. doi: 10.1002/sim.5525. Epub 2012 Jul 26. Stat Med. 2013. PMID: 22833304
-
Assessing the calibration of mortality benchmarks in critical care: The Hosmer-Lemeshow test revisited.Crit Care Med. 2007 Sep;35(9):2052-6. doi: 10.1097/01.CCM.0000275267.64078.B0. Crit Care Med. 2007. PMID: 17568333
-
Two goodness-of-fit tests for logistic regression models with continuous covariates.Stat Med. 2002 Jan 15;21(1):79-93. doi: 10.1002/sim.943. Stat Med. 2002. PMID: 11782052
-
A comparison of goodness-of-fit tests for the logistic regression model.Stat Med. 1997 May 15;16(9):965-80. doi: 10.1002/(sici)1097-0258(19970515)16:9<965::aid-sim509>3.0.co;2-o. Stat Med. 1997. PMID: 9160492 Review.
-
A discussion of calibration techniques for evaluating binary and categorical predictive models.Prev Vet Med. 2018 Jan 1;149:107-114. doi: 10.1016/j.prevetmed.2017.11.018. Epub 2017 Nov 24. Prev Vet Med. 2018. PMID: 29290291 Review.
Cited by
-
An Exploration of Smoking Patterns Among People with Serious Mental Illness Attending an Outpatient Clinic in Qatar.Neuropsychiatr Dis Treat. 2022 Dec 7;18:2847-2854. doi: 10.2147/NDT.S385970. eCollection 2022. Neuropsychiatr Dis Treat. 2022. PMID: 36518535 Free PMC article.
-
Factors associated with community volunteering among adults over the age of 50 in Malaysia.PLoS One. 2024 May 16;19(5):e0302220. doi: 10.1371/journal.pone.0302220. eCollection 2024. PLoS One. 2024. PMID: 38753828 Free PMC article.
-
White Blood Cell Count Predicts Mortality in Patients with Spontaneous Intracerebral Hemorrhage.Neurocrit Care. 2023 Oct;39(2):445-454. doi: 10.1007/s12028-023-01716-2. Epub 2023 Apr 10. Neurocrit Care. 2023. PMID: 37037993
-
Factors associated with pulmonary complications after hepatectomy and establishment of nomogram: A real-world retrospective study.Indian J Anaesth. 2025 Feb;69(2):225-235. doi: 10.4103/ija.ija_885_24. Epub 2025 Jan 29. Indian J Anaesth. 2025. PMID: 40160904 Free PMC article.
-
Development and Validation of Risk Prediction Models for Colorectal Cancer in Patients with Symptoms.J Pers Med. 2023 Jun 29;13(7):1065. doi: 10.3390/jpm13071065. J Pers Med. 2023. PMID: 37511678 Free PMC article.
References
REFERENCES
-
- Archer, K.J., Lemeshow, S. and Hosmer, D.W. (2007) Goodness-of-fit tests for logistic regression models when data are collected using a complex sampling design. Computational Statistics & Data Analysis, 51(9), 4450-4464.
-
- Austin, P.C. and Steyerberg, E.W. (2014) Graphical assessment of internal and external calibration of logistic regression models by using loess smoothers. Statistics in Medicine, 33(3), 517-535.
-
- Browne, M.W. and Cudeck, R. (1992) Alternative ways of assessing model fit. Sociological Methods & Research, 21(2), 230-258.
-
- Casella, G. and Berger, R.L. (2002) Statistical Inference. Pacific Grove, CA: Thomson Learning.
-
- Dahiya, R.C. and Gurland, J. (1973) How many classes in the Pearson chi-square test? Journal of the American Statistical Association, 68(343), 707-712.
MeSH terms
LinkOut - more resources
Full Text Sources