The performance of prognostic models depended on the choice of missing value imputation algorithm: a simulation study

doi:10.1016/j.jclinepi.2024.111539

. 2024 Dec:176:111539.

doi: 10.1016/j.jclinepi.2024.111539. Epub 2024 Sep 24.

The performance of prognostic models depended on the choice of missing value imputation algorithm: a simulation study

Manja Deforth¹, Georg Heinze², Ulrike Held³

Affiliations

¹ Department of Biostatistics at the Epidemiology, Biostatistics and Prevention Institute, University of Zurich, Zurich, Switzerland.
² Center for Medical Data Science, Institute of Clinical Biometrics, Medical University of Vienna, Vienna, Austria.
³ Department of Biostatistics at the Epidemiology, Biostatistics and Prevention Institute, University of Zurich, Zurich, Switzerland. Electronic address: ulrike.held@uzh.ch.

PMID: 39326470
DOI: 10.1016/j.jclinepi.2024.111539

Free article

The performance of prognostic models depended on the choice of missing value imputation algorithm: a simulation study

Manja Deforth et al. J Clin Epidemiol. 2024 Dec.

Free article

. 2024 Dec:176:111539.

doi: 10.1016/j.jclinepi.2024.111539. Epub 2024 Sep 24.

Authors

Manja Deforth¹, Georg Heinze², Ulrike Held³

Affiliations

¹ Department of Biostatistics at the Epidemiology, Biostatistics and Prevention Institute, University of Zurich, Zurich, Switzerland.
² Center for Medical Data Science, Institute of Clinical Biometrics, Medical University of Vienna, Vienna, Austria.
³ Department of Biostatistics at the Epidemiology, Biostatistics and Prevention Institute, University of Zurich, Zurich, Switzerland. Electronic address: ulrike.held@uzh.ch.

PMID: 39326470
DOI: 10.1016/j.jclinepi.2024.111539

Abstract

Objectives: The development of clinical prediction models is often impeded by the occurrence of missing values in the predictors. Various methods for imputing missing values before modeling have been proposed. Some of them are based on variants of multiple imputations by chained equations, while others are based on single imputation. These methods may include elements of flexible modeling or machine learning algorithms, and for some of them user-friendly software packages are available. The aim of this study was to investigate by simulation if some of these methods consistently outperform others in performance measures of clinical prediction models.

Study design and setting: We simulated development and validation cohorts by mimicking observed distributions of predictors and outcome variable of a real data set. In the development cohorts, missing predictor values were created in 36 scenarios defined by the missingness mechanism and proportion of noncomplete cases. We applied three imputation algorithms that were available in R software (R Foundation for Statistical Computing, Vienna, Austria): mice, aregImpute, and missForest. These algorithms differed in their use of linear or flexible models, or random forests, the way of sampling from the predictive posterior distribution, and the generation of a single or multiple imputed data set. For multiple imputation, we also investigated the impact of the number of imputations. Logistic regression models were fitted with the simulated development cohorts before (full data analysis) and after missing value generation (complete case analysis), and with the imputed data. Prognostic model performance was measured by the scaled Brier score, c-statistic, calibration intercept and slope, and by the mean absolute prediction error evaluated in validation cohorts without missing values. Performance of full data analysis was considered as ideal.

Results: None of the imputation methods achieved the model's predictive accuracy that would be obtained in case of no missingness. In general, complete case analysis yielded the worst performance, and deviation from ideal performance increased with increasing percentage of missingness and decreasing sample size. Across all scenarios and performance measures, aregImpute and mice, both with 100 imputations, resulted in highest predictive accuracy. Surprisingly, aregImpute outperformed full data analysis in achieving calibration slopes very close to one across all scenarios and outcome models. The increase of mice's performance with 100 compared to five imputations was only marginal. The differences between the imputation methods decreased with increasing sample sizes and decreasing proportion of noncomplete cases.

Conclusion: In our simulation study, model calibration was more affected by the choice of the imputation method than model discrimination. While differences in model performance after using imputation methods were generally small, multiple imputation methods as mice and aregImpute that can handle linear or nonlinear associations between predictors and outcome are an attractive and reliable choice in most situations.

Keywords: AregImpute; Complete case analysis; Mice; MissForest; Missing value imputation; Prediction model.

PubMed Disclaimer

Conflict of interest statement

Declaration of competing interest The authors report no conflicts of interest related to this paper.

Cited by

The role of lipid profile in the relationship between skipping breakfast and hyperuricemia: a moderated mediation model.
Deng Z, Zhou F, Tian G, Wang Q, Yan Y. Deng Z, et al. BMC Public Health. 2025 Apr 10;25(1):1347. doi: 10.1186/s12889-025-22594-7. BMC Public Health. 2025. PMID: 40211199 Free PMC article.
Imaging-pathology correlation in pancreatic cancer: Methodological considerations and future directions.
Krishnan A. Krishnan A. World J Gastrointest Oncol. 2025 Jul 15;17(7):103282. doi: 10.4251/wjgo.v17.i7.103282. World J Gastrointest Oncol. 2025. PMID: 40697244 Free PMC article.

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

LinkOut - more resources

Full Text Sources
- ClinicalKey
- Elsevier Science

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

The performance of prognostic models depended on the choice of missing value imputation algorithm: a simulation study

Affiliations

The performance of prognostic models depended on the choice of missing value imputation algorithm: a simulation study

Authors

Affiliations

Abstract

Conflict of interest statement

Similar articles

Cited by

MeSH terms

LinkOut - more resources

Full Text Sources

Abstract

Conflict of interest statement

Similar articles

Cited by

MeSH terms

Related information

LinkOut - more resources

Full Text Sources