. 2022 Jul 30;41(17):3398-3420.

doi: 10.1002/sim.9424. Epub 2022 May 17.

Penalized weighted proportional hazards model for robust variable selection and outlier detection

Bin Luo¹, Xiaoli Gao², Susan Halabi¹

Affiliations

¹ Department of Biostatistics and Bioinformatics, Duke University, Durham, North Carolina, USA.
² Department of Mathematics and Statistics, The University of North Carolina at Greensboro, Greensboro, North Carolina, USA.

PMID: 35581736
PMCID: PMC9283382
DOI: 10.1002/sim.9424

Penalized weighted proportional hazards model for robust variable selection and outlier detection

Bin Luo et al. Stat Med. 2022.

. 2022 Jul 30;41(17):3398-3420.

doi: 10.1002/sim.9424. Epub 2022 May 17.

Authors

Bin Luo¹, Xiaoli Gao², Susan Halabi¹

Affiliations

¹ Department of Biostatistics and Bioinformatics, Duke University, Durham, North Carolina, USA.
² Department of Mathematics and Statistics, The University of North Carolina at Greensboro, Greensboro, North Carolina, USA.

PMID: 35581736
PMCID: PMC9283382
DOI: 10.1002/sim.9424

Abstract

Identifying exceptional responders or nonresponders is an area of increased research interest in precision medicine as these patients may have different biological or molecular features and therefore may respond differently to therapies. Our motivation stems from a real example from a clinical trial where we are interested in characterizing exceptional prostate cancer responders. We investigate the outlier detection and robust regression problem in the sparse proportional hazards model for censored survival outcomes. The main idea is to model the irregularity of each observation by assigning an individual weight to the hazard function. By applying a LASSO-type penalty on both the model parameters and the log transformation of the weight vector, our proposed method is able to perform variable selection and outlier detection simultaneously. The optimization problem can be transformed to a typical penalized maximum partial likelihood problem and thus it is easy to implement. We further extend the proposed method to deal with the potential outlier masking problem caused by censored outcomes. The performance of the proposed estimator is demonstrated with extensive simulation studies and real data analyses in low-dimensional and high-dimensional settings.

Keywords: censoring; high-dimensional data; outlier detection; proportional hazards model; robust estimation; time-to-event outcomes; variable selection.

PubMed Disclaimer

Figures

**FIGURE A1**
Mean squared errors (MSE) of the standard Cox estimator, the oracle Cox estimator, the robust Cox estimator, the vanilla PAWPH and the PAWPH for β = (1, 2, −1)^T in scenario (a).

**FIGURE A2**
Outlier detection results from the proposed PAWPH estimator for β = (1, 2, −1)^T in scenario (a). The masking probability for outliers with $0 < w < 1 (M -)$ , the overall masking probability $(M)$ , and the swamping probability $(S)$ are plotted in each row, respectively.

**FIGURE A3**
Mean squared errors (MSE) of the Cox-ALASSO estimator, the oracle Cox-ALASSO estimator, the vanilla PAWPH and the PAWPH for β = (1, 2, −1, 0, 0, 0, 0, 0)^T in scenario (b).

**FIGURE A4**
Mean squared errors (MSE) of the Cox-ALASSO estimator, the oracle Cox-ALASSO estimator, the vanilla PAWPH and the PAWPH for high-dimensional scenario (c).

**FIGURE A5**
Mean squared errors (MSE) of the Cox-ALASSO estimator, the oracle Cox-ALASSO estimator, the vanilla PAWPH and the PAWPH for high-dimensional scenario (d).

**FIGURE A6**
Correctly fitted ratio (CFR) of the Cox-ALASSO estimator, the oracle Cox-ALASSO estimator, the vanilla PAWPH and the PAWPH for β = (1, 2, −1, 0, 0, 0, 0, 0)^T in scenario (b).

**FIGURE A7**
Correctly fitted ratio (CFR) of the Cox-ALASSO estimator, the oracle Cox-ALASSO estimator, the vanilla PAWPH and the PAWPH for high-dimensional scenario (c).

**FIGURE A8**
Outlier detection results from the proposed PAWPH estimator for β = (1, 2, −1, 0, 0, 0, 0, 0)^T in scenario (b). The masking probability for outliers with $0 < w < 1 (M -)$ , the overall masking probability $(M)$ , and the swamping probability $(S)$ are plotted in each row, respectively.

**FIGURE A9**
Outlier detection results from the proposed PAWPH estimator for high-dimensional scenario (c). The masking probability for outliers with $0 < w < 1 (M -)$ , the overall masking probability $(M)$ , and the swamping probability $(S)$ are plotted in each row, respectively.

**FIGURE A10**
Outlier detection results from the proposed PAWPH estimator for high-dimensional scenario (d). The masking probability for outliers with $0 < w < 1 (M -)$ , the overall masking probability $(M)$ , and the swamping probability $(S)$ are plotted in each row, respectively.

**FIGURE 1**
Mean squared errors (MSE) of the standard Cox estimator, the oracle Cox estimator, the vanilla PAWPH and the PAWPH for scenarios (b) p = 8 and (c) p = 1000.

**FIGURE 2**
Correctly fitted ratio (CFR) of the Cox-ALASSO estimator, the oracle Cox-ALASSO estimator, the vanilla PAWPH and the PAWPH for scenarios (b) p = 8 and (c) p = 1000.

**FIGURE 3**
Outlier detection results from the proposed PAWPH estimator for scenarios (b) p = 8 and (c) p = 1000. The masking probability for outliers with $0 < w < 1 (M -)$ , the overall masking probability $(M)$ , and the swamping probability $(S)$ are plotted in each row, respectively.

**Figure 4a**
Deviance residual plots and outlier detection of the PAWPH with p = 23.

**FIGURE 4b**
Deviance residual plots and outlier detection of the PAWPH with p = 109.

**FIGURE 5**
Survival distribution by detected outliers and normal observations for (a) p = 23 and (b) p = 109.

**FIGURE 6a**
Boxplots of tAUROCs from the testing sets over 100 splits with p = 23. The three panels correspond to 0%, 5% and 10% synthetic contamination on the training sets for each split.

**FIGURE 6b**
Boxplots of tAUROCs from the testing sets over 100 splits with p = 109. The three panels correspond to 0%, 5% and 10% synthetic contamination on the training sets for each split.

See this image and copyright information in PMC

Cited by

Robust variable selection methods with Cox model-a selective practical benchmark study.
Zhang Y, Muller S. Zhang Y, et al. Brief Bioinform. 2024 Sep 23;25(6):bbae508. doi: 10.1093/bib/bbae508. Brief Bioinform. 2024. PMID: 39400113 Free PMC article. Review.
Maternal behaviors influence survival of ungulate neonates under heavy predation risk.
Muthersbaugh MS, Boone WW, Saldo EA, Jensen AJ, Cantrell J, Ruth C, Kilgo JC, Jachowski DS. Muthersbaugh MS, et al. Ecol Evol. 2024 Aug 21;14(8):e70151. doi: 10.1002/ece3.70151. eCollection 2024 Aug. Ecol Evol. 2024. PMID: 39170052 Free PMC article.

References

1. Cox DR. Regression models and life-tables. Journal of the Royal Statistical Society: Series B (Methodological) 1972; 34(2): 187–202.
1. Bednarski T. On sensitivity of Cox’s estimator. Statistics & Risk Modeling 1989; 7(3): 215–228.
1. Minder CE, Bednarski T. A robust method for proportional hazards regression. Statistics in Medicine 1996; 15(10): 1033–1047. - PubMed
1. Valsecchi M, Silvestri D, Sasieni P. Evaluation of long-term survival: use of diagnostics and robust estimators with Cox’s proportional hazards model. Statistics in medicine 1996; 15(24): 2763–2780. - PubMed
1. Cain KC, Lange NT. Approximate case influence for the proportional hazards regression model with censored data. Biometrics 1984: 493–499. - PubMed

Publication types

Actions
Actions
Actions

MeSH terms

Actions
Actions
Actions

Grants and funding

LinkOut - more resources

Full Text Sources
- Europe PubMed Central
- PubMed Central

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Penalized weighted proportional hazards model for robust variable selection and outlier detection

Affiliations

Penalized weighted proportional hazards model for robust variable selection and outlier detection

Authors

Affiliations

Abstract

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources