Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2012 Sep 10;30(26):3297-303.
doi: 10.1200/JCO.2011.38.7589. Epub 2012 May 29.

Missing data in clinical studies: issues and methods

Affiliations

Missing data in clinical studies: issues and methods

Joseph G Ibrahim et al. J Clin Oncol. .

Abstract

Missing data are a prevailing problem in any type of data analyses. A participant variable is considered missing if the value of the variable (outcome or covariate) for the participant is not observed. In this article, various issues in analyzing studies with missing data are discussed. Particularly, we focus on missing response and/or covariate data for studies with discrete, continuous, or time-to-event end points in which generalized linear models, models for longitudinal data such as generalized linear mixed effects models, or Cox regression models are used. We discuss various classifications of missing data that may arise in a study and demonstrate in several situations that the commonly used method of throwing out all participants with any missing data may lead to incorrect results and conclusions. The methods described are applied to data from an Eastern Cooperative Oncology Group phase II clinical trial of liver cancer and a phase III clinical trial of advanced non-small-cell lung cancer. Although the main area of application discussed here is cancer, the issues and methods we discuss apply to any type of study.

PubMed Disclaimer

Conflict of interest statement

Authors' disclosures of potential conflicts of interest and author contributions are found at the end of this article.

References

    1. Lipsitz SR, Ibrahim JG. Estimating equations with incomplete categorical covariates in the Cox model. Biometrics. 1998;54:1002–1013. - PubMed
    1. Ibrahim JG, Chen MH, Lipsitz SR. Missing responses in generalised linear mixed models when the missing data mechanism is nonignorable. Biometrika. 2001;88:551–564.
    1. Little RJA, Rubin DB. Statistical Analysis With Missing Data. ed 2. Hoboken, NJ: John Wiley and Sons; 2002.
    1. Verbeke G, Molenberghs G. Linear Mixed Models for Longitudinal Data. New York, NY: Springer; 2000.
    1. Molenberghs G, Verbeke G. Models for Discrete Longitudinal Data. New York, NY: Springer; 2005.