[Analysis of longitudinal Gaussian data with missing data on the response variable]
- PMID: 10673586
[Analysis of longitudinal Gaussian data with missing data on the response variable]
Abstract
Background: Using an application and a simulation study we show the bias induced by missing data in the outcome in longitudinal studies and discuss suitable statistical methods according to the type of missing responses when the variable under study is gaussian.
Method: The model used for the analysis of gaussian longitudinal data is the mixed effects linear model. When the probability of response does not depend on the missing values of the outcome and on the parameters of the linear model, missing data are ignorable, and parameters of the mixed effects linear model may be estimated by the maximum likelihood method with classical softwares. When the missing data are non ignorable, several methods have been proposed. We describe the method proposed by Diggle and Kenward (1994) (DK method) for which a software is available. This model consists in the combination of a linear mixed effects model for the outcome variable and a logistic model for the probability of response which depends on the outcome variable.
Results: A simulation study shows the efficacy of this method and its limits when the data are not normal. In this case, estimators obtained by the DK approach may be more biased than estimators obtained under the hypothesis of ignorable missing data even if the data are non ignorable. Data of the Paquid cohort about the evolution of the scores to a neuropsychological test among elderly subjects show the bias of a naive analysis using all available data. Although missing responses are not ignorable in this study, estimates of the linear mixed effects model are not very different using the DK approach and the hypothesis of ignorable missing data.
Conclusion: Statistical methods for longitudinal data including non ignorable missing responses are sensitive to hypotheses difficult to verify. Thus, it will be better in practical applications to perform an analysis under the hypothesis of ignorable missing responses and compare the results obtained with several approaches for non ignorable missing data. However, such a strategy requires development of new softwares.
Similar articles
-
A copula model for repeated measurements with non-ignorable non-monotone missing outcome.Stat Med. 2006 Jul 30;25(14):2427-40. doi: 10.1002/sim.2355. Stat Med. 2006. PMID: 16143999
-
Analysis of non-ignorable missing and left-censored longitudinal data using a weighted random effects tobit model.Stat Med. 2011 Nov 30;30(27):3167-80. doi: 10.1002/sim.4344. Epub 2011 Sep 5. Stat Med. 2011. PMID: 21898524
-
Handling missing responses in generalized linear mixed model without specifying missing mechanism.J Biopharm Stat. 2009 Nov;19(6):1001-17. doi: 10.1080/10543400903242761. J Biopharm Stat. 2009. PMID: 20183461
-
A Bayesian natural cubic B-spline varying coefficient method for non-ignorable dropout.BMC Med Res Methodol. 2020 Oct 7;20(1):250. doi: 10.1186/s12874-020-01135-3. BMC Med Res Methodol. 2020. PMID: 33028226 Free PMC article. Review.
-
Missing repeated measures data in clinical trials.Neurooncol Pract. 2021 Jul 16;9(1):35-42. doi: 10.1093/nop/npab043. eCollection 2022 Feb. Neurooncol Pract. 2021. PMID: 35096402 Free PMC article. Review.
Cited by
-
Sensitivity of four psychometric tests to measure cognitive changes in brain aging-population-based studies.Am J Epidemiol. 2007 Feb 1;165(3):344-50. doi: 10.1093/aje/kwk017. Epub 2006 Nov 14. Am J Epidemiol. 2007. PMID: 17105962 Free PMC article.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources