Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2019 Jun;28(6):1676-1688.
doi: 10.1177/0962280218772592. Epub 2018 May 2.

Cox regression analysis with missing covariates via nonparametric multiple imputation

Affiliations

Cox regression analysis with missing covariates via nonparametric multiple imputation

Chiu-Hsieh Hsu et al. Stat Methods Med Res. 2019 Jun.

Abstract

We consider the situation of estimating Cox regression in which some covariates are subject to missing, and there exists additional information (including observed event time, censoring indicator and fully observed covariates) which may be predictive of the missing covariates. We propose to use two working regression models: one for predicting the missing covariates and the other for predicting the missing probabilities. For each missing covariate observation, these two working models are used to define a nearest neighbor imputing set. This set is then used to non-parametrically impute covariate values for the missing observation. Upon the completion of imputation, Cox regression is performed on the multiply imputed datasets to estimate the regression coefficients. In a simulation study, we compare the nonparametric multiple imputation approach with the augmented inverse probability weighted (AIPW) method, which directly incorporates the two working models into estimation of Cox regression, and the predictive mean matching imputation (PMM) method. We show that all approaches can reduce bias due to non-ignorable missing mechanism. The proposed nonparametric imputation method is robust to mis-specification of either one of the two working models and robust to mis-specification of the link function of the two working models. In contrast, the PMM method is sensitive to misspecification of the covariates included in imputation. The AIPW method is sensitive to the selection probability. We apply the approaches to a breast cancer dataset from Surveillance, Epidemiology and End Results (SEER) Program.

Keywords: Augmented inverse probability weighted method; Cox regression; missing covariates; multiple imputation; predictive mean matching.

PubMed Disclaimer

Conflict of interest statement

Declaration of conflicting interests

The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Similar articles

Cited by

References

    1. Cox DR. Regression models and life-tables. J Royal Stat Soc Ser B (Methodological) 1972; 34: 187–220.
    1. Cox DR. Partial likelihood. Biometrika 1975; 62: 269–276.
    1. Andersen PK and Gill RD. Cox’s regression model for counting processes: a large sample study. Ann Stat 1982; 10: 1100–1120.
    1. Little RJA and Rubin DB. Statistical analysis with missing data, 2nd ed. New York, NY: Wiley, 2002.
    1. Rubin DB. Multiple imputation for nonresponse in surveys New York, NY: Wiley, 1987.

Publication types