Estimation in the semiparametric accelerated failure time model with missing covariates: improving efficiency through augmentation
- PMID: 33033419
- PMCID: PMC7540935
- DOI: 10.1080/01621459.2016.1205500
Estimation in the semiparametric accelerated failure time model with missing covariates: improving efficiency through augmentation
Abstract
This paper considers linear regression with missing covariates and a right censored outcome. We first consider a general two-phase outcome sampling design, where full covariate information is only ascertained for subjects in phase two and sampling occurs under an independent Bernoulli sampling scheme with known subject-specific sampling probabilities that depend on phase one information (e.g., survival time, failure status and covariates). The semiparametric information bound is derived for estimating the regression parameter in this setting. We also introduce a more practical class of augmented estimators that is shown to improve asymptotic efficiency over simple but inefficient inverse probability of sampling weighted estimators. Estimation for known sampling weights and extensions to the case of estimated sampling weights are both considered. The allowance for estimated sampling weights permits covariates to be missing at random according to a monotone but unknown mechanism. The asymptotic properties of the augmented estimators are derived and simulation results demonstrate substantial efficiency improvements over simpler inverse probability of sampling weighted estimators in the indicated settings. With suitable modification, the proposed methodology can also be used to improve augmented estimators previously used for missing covariates in a Cox regression model.
Figures

Similar articles
-
Analysis of two-phase sampling data with semiparametric additive hazards models.Lifetime Data Anal. 2017 Jul;23(3):377-399. doi: 10.1007/s10985-016-9363-2. Epub 2016 Mar 19. Lifetime Data Anal. 2017. PMID: 26995733 Free PMC article.
-
Best linear inverse probability weighted estimation for two-phase designs and missing covariate regression.Stat Med. 2019 Jul 10;38(15):2783-2796. doi: 10.1002/sim.8141. Epub 2019 Mar 25. Stat Med. 2019. PMID: 30908669 Free PMC article.
-
Pseudo-partial likelihood estimators for the Cox regression model with missing covariates.Biometrika. 2009 Sep;96(3):617-633. doi: 10.1093/biomet/asp027. Epub 2009 Jun 22. Biometrika. 2009. PMID: 23946546 Free PMC article.
-
Analysis of case-cohort designs with binary outcomes: Improving efficiency using whole-cohort auxiliary information.Stat Methods Med Res. 2017 Apr;26(2):691-706. doi: 10.1177/0962280214556175. Epub 2014 Oct 26. Stat Methods Med Res. 2017. PMID: 25348675 Review.
-
Recent progresses in outcome-dependent sampling with failure time data.Lifetime Data Anal. 2017 Jan;23(1):57-82. doi: 10.1007/s10985-015-9355-7. Epub 2016 Jan 13. Lifetime Data Anal. 2017. PMID: 26759313 Free PMC article. Review.
Cited by
-
Efficient estimation for left-truncated competing risks regression for case-cohort studies.Biometrics. 2024 Jan 29;80(1):ujad008. doi: 10.1093/biomtc/ujad008. Biometrics. 2024. PMID: 38281769 Free PMC article.
-
Regularized Buckley-James method for right-censored outcomes with block-missing multimodal covariates.Stat (Int Stat Inst). 2022 Dec;11(1):e515. doi: 10.1002/sta4.515. Epub 2022 Oct 13. Stat (Int Stat Inst). 2022. PMID: 37854542 Free PMC article.
References
-
- Borgan O, Langholz B, Samuelsen SO, Goldstein L, and Pogoda J “Exposure stratified case-cohort designs.” Lifetime Data Analysis, 6(1):39–58 (2000). - PubMed
-
- Buckley J and James I “Linear regression with censored data.” Biometrika, 66(3):429–436 (1979).
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources