Attrition in longitudinal studies. How to deal with missing data
- PMID: 11927199
- DOI: 10.1016/s0895-4356(01)00476-0
Attrition in longitudinal studies. How to deal with missing data
Abstract
The purpose of this paper was to illustrate the influence of missing data on the results of longitudinal statistical analyses [i.e., MANOVA for repeated measurements and Generalised Estimating Equations (GEE)] and to illustrate the influence of using different imputation methods to replace missing data. Besides a complete dataset, four incomplete datasets were considered: two datasets with 10% missing data and two datasets with 25% missing data. In both situations missingness was considered independent and dependent on observed data. Imputation methods were divided into cross-sectional methods (i.e., mean of series, hot deck, and cross-sectional regression) and longitudinal methods (i.e., last value carried forward, longitudinal interpolation, and longitudinal regression). Besides these, also the multiple imputation method was applied and discussed. The analyses were performed on a particular (observational) longitudinal dataset, with particular missing data patterns and imputation methods. The results of this illustration shows that when MANOVA for repeated measurements is used, imputation methods are highly recommendable (because MANOVA as implemented in the software used, uses listwise deletion of cases with a missing value). Applying GEE analysis, imputation methods were not necessary. When imputation methods were used, longitudinal imputation methods were often preferable above cross-sectional imputation methods, in a way that the point estimates and standard errors were closer to the estimates derived from the complete dataset. Furthermore, this study showed that the theoretically more valid multiple imputation method did not lead to different point estimates than the more simple (longitudinal) imputation methods. However, the estimated standard errors appeared to be theoretically more adequate, because they reflect the uncertainty in estimation caused by missing values.
Similar articles
-
Missing data in longitudinal studies: cross-sectional multiple imputation provides similar estimates to full-information maximum likelihood.Ann Epidemiol. 2014 Jan;24(1):75-7. doi: 10.1016/j.annepidem.2013.10.007. Epub 2013 Oct 18. Ann Epidemiol. 2014. PMID: 24210708
-
Multiple imputation for non-response when estimating HIV prevalence using survey data.BMC Public Health. 2015 Oct 16;15:1059. doi: 10.1186/s12889-015-2390-1. BMC Public Health. 2015. PMID: 26475303 Free PMC article.
-
A comparison of multiple imputation methods for missing data in longitudinal studies.BMC Med Res Methodol. 2018 Dec 12;18(1):168. doi: 10.1186/s12874-018-0615-6. BMC Med Res Methodol. 2018. PMID: 30541455 Free PMC article.
-
Review: a gentle introduction to imputation of missing values.J Clin Epidemiol. 2006 Oct;59(10):1087-91. doi: 10.1016/j.jclinepi.2006.01.014. Epub 2006 Jul 11. J Clin Epidemiol. 2006. PMID: 16980149 Review.
-
Multiple imputation for missing data.Res Nurs Health. 2002 Feb;25(1):76-84. doi: 10.1002/nur.10015. Res Nurs Health. 2002. PMID: 11807922 Review.
Cited by
-
Gratitude at Work Prospectively Predicts Lower Workplace Materialism: A Three-Wave Longitudinal Study in Chile.Int J Environ Res Public Health. 2021 Apr 5;18(7):3787. doi: 10.3390/ijerph18073787. Int J Environ Res Public Health. 2021. PMID: 33916410 Free PMC article.
-
A phase III randomized controlled trial of radiation dose optimization in non-Hodgkin lymphoma-diffuse large B-cell lymphoma (DOBL study): Study protocol and design.Cancer Rep (Hoboken). 2019 Apr;2(2):e1161. doi: 10.1002/cnr2.1161. Epub 2019 Feb 14. Cancer Rep (Hoboken). 2019. PMID: 32935480 Free PMC article.
-
Early prediction of end-stage kidney disease using electronic health record data: a machine learning approach with a 2-year horizon.JAMIA Open. 2024 Feb 27;7(1):ooae015. doi: 10.1093/jamiaopen/ooae015. eCollection 2024 Apr. JAMIA Open. 2024. PMID: 38414534 Free PMC article.
-
Missing paternal demographics: A novel indicator for identifying high risk population of adverse pregnancy outcomes.BMC Pregnancy Childbirth. 2004 Nov 13;4(1):21. doi: 10.1186/1471-2393-4-21. BMC Pregnancy Childbirth. 2004. PMID: 15541183 Free PMC article.
-
Being Transparent About Brilliant Failures: An Attempt to Use Real-World Data in a Disease Model for Patients with Castration-Resistant Prostate Cancer.Drugs Real World Outcomes. 2022 Jun;9(2):275-285. doi: 10.1007/s40801-022-00294-7. Epub 2022 Mar 21. Drugs Real World Outcomes. 2022. PMID: 35314962 Free PMC article.
MeSH terms
LinkOut - more resources
Full Text Sources