Variational Bayes for high-dimensional proportional hazards models with applications within gene expression
- PMID: 35751586
- PMCID: PMC9364383
- DOI: 10.1093/bioinformatics/btac416
Variational Bayes for high-dimensional proportional hazards models with applications within gene expression
Abstract
Motivation: Few Bayesian methods for analyzing high-dimensional sparse survival data provide scalable variable selection, effect estimation and uncertainty quantification. Such methods often either sacrifice uncertainty quantification by computing maximum a posteriori estimates, or quantify the uncertainty at high (unscalable) computational expense.
Results: We bridge this gap and develop an interpretable and scalable Bayesian proportional hazards model for prediction and variable selection, referred to as sparse variational Bayes. Our method, based on a mean-field variational approximation, overcomes the high computational cost of Markov chain Monte Carlo, whilst retaining useful features, providing a posterior distribution for the parameters and offering a natural mechanism for variable selection via posterior inclusion probabilities. The performance of our proposed method is assessed via extensive simulations and compared against other state-of-the-art Bayesian variable selection methods, demonstrating comparable or better performance. Finally, we demonstrate how the proposed method can be used for variable selection on two transcriptomic datasets with censored survival outcomes, and how the uncertainty quantification offered by our method can be used to provide an interpretable assessment of patient risk.
Availability and implementation: our method has been implemented as a freely available R package survival.svb (https://github.com/mkomod/survival.svb).
Supplementary information: Supplementary data are available at Bioinformatics online.
© The Author(s) 2022. Published by Oxford University Press.
Figures


Similar articles
-
Implementation of a practical Markov chain Monte Carlo sampling algorithm in PyBioNetFit.Bioinformatics. 2022 Mar 4;38(6):1770-1772. doi: 10.1093/bioinformatics/btac004. Bioinformatics. 2022. PMID: 34986226 Free PMC article.
-
Bayesian uncertainty quantification and propagation in molecular dynamics simulations: a high performance computing framework.J Chem Phys. 2012 Oct 14;137(14):144103. doi: 10.1063/1.4757266. J Chem Phys. 2012. PMID: 23061835
-
The spike-and-slab lasso Cox model for survival prediction and associated genes detection.Bioinformatics. 2017 Sep 15;33(18):2799-2807. doi: 10.1093/bioinformatics/btx300. Bioinformatics. 2017. PMID: 28472220 Free PMC article.
-
A simple approach to fitting Bayesian survival models.Lifetime Data Anal. 2003 Mar;9(1):5-19. doi: 10.1023/a:1021821002693. Lifetime Data Anal. 2003. PMID: 12602771 Review.
-
High-dimensional genomic feature selection with the ordered stereotype logit model.Brief Bioinform. 2022 Nov 19;23(6):bbac414. doi: 10.1093/bib/bbac414. Brief Bioinform. 2022. PMID: 36184192 Free PMC article. Review.
Cited by
-
Adaptive MCMC for Bayesian Variable Selection in Generalised Linear Models and Survival Models.Entropy (Basel). 2023 Sep 8;25(9):1310. doi: 10.3390/e25091310. Entropy (Basel). 2023. PMID: 37761609 Free PMC article.
-
The spike-and-slab quantile LASSO for robust variable selection in cancer genomics studies.Stat Med. 2024 Nov 20;43(26):4928-4983. doi: 10.1002/sim.10196. Epub 2024 Sep 11. Stat Med. 2024. PMID: 39260448
-
BAYESIAN VARIABLE SELECTION IN A COX PROPORTIONAL HAZARDS MODEL WITH THE "SUM OF SINGLE EFFECTS" PRIOR.ArXiv [Preprint]. 2025 Jun 6:arXiv:2506.06233v1. ArXiv. 2025. PMID: 40503016 Free PMC article. Preprint.
References
-
- Antoniadis A. et al. (2010) The dantzig selector in cox’s proportional hazards model. Scand. J. Stat., 37, 531–552.
-
- Bai R. et al. (2021) Spike-and-Slab Meets LASSO: A Review of the Spike-and-Slab LASSO. arXiv preprint arXiv:2010.06451, May 2021.
-
- Banerjee S. et al. (2021) Bayesian Inference in High-Dimensional Models. arXiv preprint arXiv:2101.04491, Jan 2021.
-
- Bhadra A. et al. (2019) Lasso meets horseshoe: a survey. Stat. Sci., 34, 405–427.
-
- Blei D.M., Lafferty J.D. (2007) A correlated topic model of science. Ann. Appl. Stat., 1, 17–35.
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Miscellaneous