Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2017;112(519):1221-1235.
doi: 10.1080/01621459.2016.1205500. Epub 2017 Apr 25.

Estimation in the semiparametric accelerated failure time model with missing covariates: improving efficiency through augmentation

Affiliations

Estimation in the semiparametric accelerated failure time model with missing covariates: improving efficiency through augmentation

Jon Arni Steingrimsson et al. J Am Stat Assoc. 2017.

Abstract

This paper considers linear regression with missing covariates and a right censored outcome. We first consider a general two-phase outcome sampling design, where full covariate information is only ascertained for subjects in phase two and sampling occurs under an independent Bernoulli sampling scheme with known subject-specific sampling probabilities that depend on phase one information (e.g., survival time, failure status and covariates). The semiparametric information bound is derived for estimating the regression parameter in this setting. We also introduce a more practical class of augmented estimators that is shown to improve asymptotic efficiency over simple but inefficient inverse probability of sampling weighted estimators. Estimation for known sampling weights and extensions to the case of estimated sampling weights are both considered. The allowance for estimated sampling weights permits covariates to be missing at random according to a monotone but unknown mechanism. The asymptotic properties of the augmented estimators are derived and simulation results demonstrate substantial efficiency improvements over simpler inverse probability of sampling weighted estimators in the indicated settings. With suitable modification, the proposed methodology can also be used to improve augmented estimators previously used for missing covariates in a Cox regression model.

PubMed Disclaimer

Figures

Figure 1:
Figure 1:
Relative efficiency of β^ALR, β^HT,γ^, and β^ALR,γ^ compared to β^HT for both the Gehan and the Log-Rank weight in the case of a 70% censoring rate; values exceeding 1.0 imply β^HT is less efficient. The first row shows the result when using the correct model for the conditional expectations and the second row corresponds to the misspecified model. The first column shows the efficiency gains for the phase one coefficient β1 and the second column shows the efficiency gains for the phase two coefficient β2. In the legend Aug denotes an augmented estimator, EW denotes that estimated sampling probabilities were used, and Gehan and LR denotes the Gehan and Log-Rank weight. The labels on the x-axis represent the six different distributions used: SN = N(0,1), MN = 0.95 N(0,1) + 0.05 N(0,9), HN = 0.5 N(0,1) + 0.5 N(0,9), W1 = Weibull(2,1), W2 = Weibull(0.5,1), EV = Extreme Value.

Similar articles

Cited by

References

    1. Borgan O, Langholz B, Samuelsen SO, Goldstein L, and Pogoda J “Exposure stratified case-cohort designs.” Lifetime Data Analysis, 6(1):39–58 (2000). - PubMed
    1. Breslow NE, Hu J, and Wellner JA “Z-estimation and stratified samples: application to survival models.” Lifetime data analysis, 1–24 (2015). - PMC - PubMed
    1. Breslow NE, Lumley T, Ballantyne CM, Chambless LE, and Kulich M “Improved Horvitz–Thompson estimation of model parameters from two-phase stratified samples: applications in epidemiology.” Statistics in Biosciences, 1(1):32–49 (2009). - PMC - PubMed
    1. Breslow NE and Wellner JA “Weighted likelihood for semiparametric models and two-phase stratified samples, with application to Cox regression.” Scandinavian Journal of Statistics, 34(1):86–102 (2007). - PMC - PubMed
    1. Buckley J and James I “Linear regression with censored data.” Biometrika, 66(3):429–436 (1979).

LinkOut - more resources