Missing Data in Surgical Data Sets: A Review of Pertinent Issues and Solutions
- PMID: 30463724
- DOI: 10.1016/j.jss.2018.06.034
Missing Data in Surgical Data Sets: A Review of Pertinent Issues and Solutions
Abstract
Incomplete data is a common problem in research studies. Methods to address missing observations in a data set have been extensively researched and described. Disseminating these methods to the greater research community is an ongoing effort. In this article, we describe some of the basic principles of missing data and identify practical, commonly used methods of adjustment relevant to surgical data sets. Through an example data set, we compare models generated through complete case analysis, single imputation (SI), and multiple imputation (MI). We also provide information on the steps to conduct MI using Stata IC. In our comparisons, we found that differences in odds ratios were greatest between the results from complete case analysis compared to the SI and MI models indicating that in this case the reduction in statistical power has a non-negligible effect on the parameter estimates. Odds ratio estimates from the SI and MI methods were largely similar. In some instances, when compared to the MI method, the SI method tended to overestimate effect sizes. While in this example the differences in odds ratios do not vary greatly between the SI and MI methods, there are clear indications supporting the use of MI over SI. By describing the issues surrounding missing data and the available options for adjustment, we hope to encourage the use of robust imputation methods for missing observations.
Keywords: Complete case analysis; Missing data; Multiple imputation; Single imputation; Statistical methodology.
Copyright © 2018 Elsevier Inc. All rights reserved.
Similar articles
-
The use of multiple imputation for the analysis of missing data.Psychol Methods. 2001 Dec;6(4):317-29. Psychol Methods. 2001. PMID: 11778675 Review.
-
Approach to addressing missing data for electronic medical records and pharmacy claims data research.Pharmacotherapy. 2015 Apr;35(4):380-7. doi: 10.1002/phar.1569. Pharmacotherapy. 2015. PMID: 25884526
-
Multiple imputation strategies for zero-inflated cost data in economic evaluations: which method works best?Eur J Health Econ. 2016 Nov;17(8):939-950. doi: 10.1007/s10198-015-0734-5. Epub 2015 Oct 23. Eur J Health Econ. 2016. PMID: 26497027 Free PMC article.
-
Imputation strategies in the trauma registration.J Trauma Acute Care Surg. 2017 Nov;83(5):828-836. doi: 10.1097/TA.0000000000001664. J Trauma Acute Care Surg. 2017. PMID: 28787379
-
Missing Data in Clinical Research: A Tutorial on Multiple Imputation.Can J Cardiol. 2021 Sep;37(9):1322-1331. doi: 10.1016/j.cjca.2020.11.010. Epub 2020 Dec 1. Can J Cardiol. 2021. PMID: 33276049 Free PMC article. Review.
Cited by
-
An Approach towards Increasing Prediction Accuracy for the Recovery of Missing IoT Data Based on the GRNN-SGTM Ensemble.Sensors (Basel). 2020 May 4;20(9):2625. doi: 10.3390/s20092625. Sensors (Basel). 2020. PMID: 32375400 Free PMC article.
-
Factors associated with large-for-gestational-age infants born after frozen embryo transfer cycles.F S Rep. 2022 Sep 9;3(4):332-341. doi: 10.1016/j.xfre.2022.09.002. eCollection 2022 Dec. F S Rep. 2022. PMID: 36568928 Free PMC article.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Medical