Scalable algorithms for semiparametric accelerated failure time models in high dimensions
- PMID: 35014701
- DOI: 10.1002/sim.9264
Scalable algorithms for semiparametric accelerated failure time models in high dimensions
Abstract
Semiparametric accelerated failure time (AFT) models are a useful alternative to Cox proportional hazards models, especially when the assumption of constant hazard ratios is untenable. However, rank-based criteria for fitting AFT models are often nondifferentiable, which poses a computational challenge in high-dimensional settings. In this article, we propose a new alternating direction method of multipliers algorithm for fitting semiparametric AFT models by minimizing a penalized rank-based loss function. Our algorithm scales well in both the number of subjects and number of predictors, and can easily accommodate a wide range of popular penalties. To improve the selection of tuning parameters, we propose a new criterion which avoids some common problems in cross-validation with censored responses. Through extensive simulation studies, we show that our algorithm and software is much faster than existing methods (which can only be applied to special cases), and we show that estimators which minimize a penalized rank-based criterion often outperform alternative estimators which minimize penalized weighted least squares criteria. Application to nine cancer datasets further demonstrates that rank-based estimators of semiparametric AFT models are competitive with estimators assuming proportional hazards in high-dimensional settings, whereas weighted least squares estimators are often not. A software package implementing the algorithm, along with a set of auxiliary functions, is available for download at github.com/ajmolstad/penAFT.
Keywords: Gehan estimator; accelerated failure time model; bilevel variable selection; convex optimization; semiparametrics; survival analysis.
© 2022 John Wiley & Sons, Ltd.
Similar articles
-
Deep Neural Network-Based Accelerated Failure Time Models Using Rank Loss.Stat Med. 2024 Dec 10;43(28):5331-5343. doi: 10.1002/sim.10235. Epub 2024 Oct 12. Stat Med. 2024. PMID: 39394866
-
Accelerated failure time modeling via nonparametric mixtures.Biometrics. 2023 Mar;79(1):165-177. doi: 10.1111/biom.13556. Epub 2021 Sep 20. Biometrics. 2023. PMID: 34480750
-
On semiparametric accelerated failure time models with time-varying covariates: A maximum penalised likelihood estimation.Stat Med. 2023 Dec 30;42(30):5577-5595. doi: 10.1002/sim.9926. Epub 2023 Oct 16. Stat Med. 2023. PMID: 37845791
-
Flexible boosting of accelerated failure time models.BMC Bioinformatics. 2008 Jun 6;9:269. doi: 10.1186/1471-2105-9-269. BMC Bioinformatics. 2008. PMID: 18538026 Free PMC article.
-
A new estimation method for the semiparametric accelerated failure time mixture cure model.Stat Med. 2007 Jul 20;26(16):3157-71. doi: 10.1002/sim.2748. Stat Med. 2007. PMID: 17094075
Cited by
-
sparsesurv: a Python package for fitting sparse survival models via knowledge distillation.Bioinformatics. 2024 Sep 2;40(9):btae521. doi: 10.1093/bioinformatics/btae521. Bioinformatics. 2024. PMID: 39177085 Free PMC article.
-
A Multi-Omics Framework for Survival Mediation Analysis of High-Dimensional Proteogenomic Data.ArXiv [Preprint]. 2025 Mar 11:arXiv:2503.08606v1. ArXiv. 2025. PMID: 40160448 Free PMC article. Preprint.
References
REFERENCES
-
- Wei LJ. The accelerated failure time model: a useful alternative to the Cox regression model in survival analysis. Stat Med. 1992;11(14-15):1871-1879.
-
- Kalbfleisch JD, Prentice RL. The Statistical Analysis of Failure Time Data. Vol 360. Hoboken, NJ: John Wiley & Sons; 2011.
-
- Zhou M. M-estimation in censored linear models. Biometrika. 1992;79(4):837-841.
-
- Stute W. Consistent estimation under random censorship when covariables are present. J Multivar Anal. 1993;45(1):89-103.
-
- Stute W. Distributional convergence under random censorship when covariables are present. Scand J Stat. 1996;23(4):461-471.