Multiple imputation of baseline data in the cardiovascular health study
- PMID: 12505893
- DOI: 10.1093/aje/kwf156
Multiple imputation of baseline data in the cardiovascular health study
Abstract
Most epidemiologic studies will encounter missing covariate data. Software packages typically used for analyzing data delete any cases with a missing covariate to perform a complete case analysis. The deletion of cases complicates variable selection when different variables are missing on different cases, reduces power, and creates the potential for bias in the resulting estimates. Recently, software has become available for producing multiple imputations of missing data that account for the between-imputation variability. The implementation of the software to impute missing baseline data in the setting of the Cardiovascular Health Study, a large, observational study, is described. Results of exploratory analyses using the imputed data were largely consistent with results using only complete cases, even in a situation where one third of the cases were excluded from the complete case analysis. There were few differences in the exploratory results across three imputations, and the combined results from the multiple imputations were very similar to results from a single imputation. An increase in power was evident and variable selection simplified when using the imputed data sets.
Similar articles
-
Multiple imputation for non-response when estimating HIV prevalence using survey data.BMC Public Health. 2015 Oct 16;15:1059. doi: 10.1186/s12889-015-2390-1. BMC Public Health. 2015. PMID: 26475303 Free PMC article.
-
Use of multiple imputation in the epidemiologic literature.Am J Epidemiol. 2008 Aug 15;168(4):355-7. doi: 10.1093/aje/kwn071. Epub 2008 Jun 30. Am J Epidemiol. 2008. PMID: 18591202 Free PMC article. Review.
-
Imputation of missing longitudinal data: a comparison of methods.J Clin Epidemiol. 2003 Oct;56(10):968-76. doi: 10.1016/s0895-4356(03)00170-7. J Clin Epidemiol. 2003. PMID: 14568628
-
[Multiple imputations for missing data: a simulation with epidemiological data].Cad Saude Publica. 2009 Feb;25(2):268-78. doi: 10.1590/s0102-311x2009000200005. Cad Saude Publica. 2009. PMID: 19219234 Portuguese.
-
Multiple imputation for missing data.Res Nurs Health. 2002 Feb;25(1):76-84. doi: 10.1002/nur.10015. Res Nurs Health. 2002. PMID: 11807922 Review.
Cited by
-
Multiple imputation for missing with cardiac magnetic resonance imaging data: results from the Multi-Ethnic Study of Atherosclerosis (MESA).Can J Cardiol. 2009 Jul;25(7):e232-5. doi: 10.1016/s0828-282x(09)70507-0. Can J Cardiol. 2009. PMID: 19584978 Free PMC article.
-
Genetic variants related to height and risk of atrial fibrillation: the cardiovascular health study.Am J Epidemiol. 2014 Jul 15;180(2):215-22. doi: 10.1093/aje/kwu126. Epub 2014 Jun 18. Am J Epidemiol. 2014. PMID: 24944287 Free PMC article.
-
Risk factor redistribution of the national HIV/AIDS surveillance data: an alternative approach.Public Health Rep. 2008 Sep-Oct;123(5):618-27. doi: 10.1177/003335490812300512. Public Health Rep. 2008. PMID: 18828417 Free PMC article.
-
Lipoprotein-associated phospholipase A2 (Lp-PLA2) and future risk of type 2 diabetes: results from the Cardiovascular Health Study.J Clin Endocrinol Metab. 2012 May;97(5):1695-701. doi: 10.1210/jc.2011-3026. Epub 2012 Mar 7. J Clin Endocrinol Metab. 2012. PMID: 22399516 Free PMC article.
-
The influence of sex on cardiovascular outcomes associated with diabetes among older black and white adults.J Diabetes Complications. 2014 May-Jun;28(3):316-22. doi: 10.1016/j.jdiacomp.2013.12.004. Epub 2013 Dec 26. J Diabetes Complications. 2014. PMID: 24461547 Free PMC article.
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources