Assessing the goodness of fit of logistic regression models in large samples: A modification of the Hosmer-Lemeshow test

Giovanni Nattino^{1

2}, Michael L Pennell², Stanley Lemeshow²

Affiliations

¹ The Ohio Colleges of Medicine Government Resource Center, Ohio State University, Columbus, Ohio.
² Division of Biostatistics, College of Public Health, Ohio State University, Columbus, Ohio.

PMID: 32134502
DOI: 10.1111/biom.13249

Assessing the goodness of fit of logistic regression models in large samples: A modification of the Hosmer-Lemeshow test

Giovanni Nattino et al. Biometrics. 2020 Jun.

. 2020 Jun;76(2):549-560.

doi: 10.1111/biom.13249. Epub 2020 Apr 6.

Authors

Giovanni Nattino^{1

2}, Michael L Pennell², Stanley Lemeshow²

Affiliations

¹ The Ohio Colleges of Medicine Government Resource Center, Ohio State University, Columbus, Ohio.
² Division of Biostatistics, College of Public Health, Ohio State University, Columbus, Ohio.

PMID: 32134502
DOI: 10.1111/biom.13249

Abstract

Evaluating the goodness of fit of logistic regression models is crucial to ensure the accuracy of the estimated probabilities. Unfortunately, such evaluation is problematic in large samples. Because the power of traditional goodness of fit tests increases with the sample size, practically irrelevant discrepancies between estimated and true probabilities are increasingly likely to cause the rejection of the hypothesis of perfect fit in larger and larger samples. This phenomenon has been widely documented for popular goodness of fit tests, such as the Hosmer-Lemeshow test. To address this limitation, we propose a modification of the Hosmer-Lemeshow approach. By standardizing the noncentrality parameter that characterizes the alternative distribution of the Hosmer-Lemeshow statistic, we introduce a parameter that measures the goodness of fit of a model but does not depend on the sample size. We provide the methodology to estimate this parameter and construct confidence intervals for it. Finally, we propose a formal statistical test to rigorously assess whether the fit of a model, albeit not perfect, is acceptable for practical purposes. The proposed method is compared in a simulation study with a competing modification of the Hosmer-Lemeshow test, based on repeated subsampling. We provide a step-by-step illustration of our method using a model for postneonatal mortality developed in a large cohort of more than 300 000 observations.

Keywords: Hosmer-Lemeshow test; calibration; goodness of fit; large samples; logistic regression; noncentrality parameter.

PubMed Disclaimer

Comment in

Discussion of "Assessing the goodness-of-fit of logistic regression models in large samples: A modification of the Hosmer-Lemeshow test," by Giovanni Nattino, Michael L. Pennell, and Stanley Lemeshow.
Chen LC, Wang JY. Chen LC, et al. Biometrics. 2020 Jun;76(2):569-571. doi: 10.1111/biom.13255. Epub 2020 Apr 6. Biometrics. 2020. PMID: 32251523 No abstract available.
Discussion on "Assessing the goodness of fit of logistic regression models in large samples: A modification of the Hosmer-Lemeshow test" by Giovanni Nattino, Michael L. Pennell, and Stanley Lemeshow.
Liu D, Shepherd BE. Liu D, et al. Biometrics. 2020 Jun;76(2):572-574. doi: 10.1111/biom.13248. Epub 2020 Apr 6. Biometrics. 2020. PMID: 32251529 Free PMC article. No abstract available.
Discussion on "Assessing the goodness of fit of logistic regression models in large samples: A modification of the Hosmer-Lemeshow test" by Giovanni Nattino, Michael L. Pennell, and Stanley Lemeshow.
Shmueli G. Shmueli G. Biometrics. 2020 Jun;76(2):561-563. doi: 10.1111/biom.13257. Epub 2020 Apr 6. Biometrics. 2020. PMID: 32251532 No abstract available.
Rejoinder to "Assessing the goodness of fit of logistic regression models in large samples: A modification of the Hosmer-Lemeshow test".
Nattino G, Pennell ML, Lemeshow S. Nattino G, et al. Biometrics. 2020 Jun;76(2):575-577. doi: 10.1111/biom.13250. Epub 2020 Apr 6. Biometrics. 2020. PMID: 32251533 No abstract available.
Discussion on "Assessing the goodness of fit of logistic regression models in large samples: A modification of the Hosmer-Lemeshow test" by Giovanni Nattino, Michael L. Pennell, and Stanley Lemeshow.
Liu I, Fernández D. Liu I, et al. Biometrics. 2020 Jun;76(2):564-568. doi: 10.1111/biom.13251. Epub 2020 Apr 6. Biometrics. 2020. PMID: 32251538 No abstract available.

References

REFERENCES

1. Archer, K.J., Lemeshow, S. and Hosmer, D.W. (2007) Goodness-of-fit tests for logistic regression models when data are collected using a complex sampling design. Computational Statistics & Data Analysis, 51(9), 4450-4464.
1. Austin, P.C. and Steyerberg, E.W. (2014) Graphical assessment of internal and external calibration of logistic regression models by using loess smoothers. Statistics in Medicine, 33(3), 517-535.
1. Browne, M.W. and Cudeck, R. (1992) Alternative ways of assessing model fit. Sociological Methods & Research, 21(2), 230-258.
1. Casella, G. and Berger, R.L. (2002) Statistical Inference. Pacific Grove, CA: Thomson Learning.
1. Dahiya, R.C. and Gurland, J. (1973) How many classes in the Pearson chi-square test? Journal of the American Statistical Association, 68(343), 707-712.

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

LinkOut - more resources

Full Text Sources
- Silverchair Information Systems
- Wiley

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Assessing the goodness of fit of logistic regression models in large samples: A modification of the Hosmer-Lemeshow test

Affiliations

Assessing the goodness of fit of logistic regression models in large samples: A modification of the Hosmer-Lemeshow test

Authors

Affiliations

Abstract

Comment in

References

REFERENCES

MeSH terms

LinkOut - more resources

Full Text Sources