Estimation of Heterogeneous Restricted Mean Survival Time Using Random Forest
- PMID: 33584791
- PMCID: PMC7873855
- DOI: 10.3389/fgene.2020.587378
Estimation of Heterogeneous Restricted Mean Survival Time Using Random Forest
Abstract
Estimation and prediction of heterogeneous restricted mean survival time (hRMST) is of great clinical importance, which can provide an easily interpretable and clinically meaningful summary of the survival function in the presence of censoring and individual covariates. The existing methods for the modeling of hRMST rely on proportional hazards or other parametric assumptions on the survival distribution. In this paper, we propose a random forest based estimation of hRMST for right-censored survival data with covariates and prove a central limit theorem for the resulting estimator. In addition, we present a computationally efficient construction for the confidence interval of hRMST. Our simulations show that the resulting confidence intervals have the correct coverage probability of the hRMST, and the random forest based estimate of hRMST has smaller prediction errors than the parametric models when the models are mis-specified. We apply the method to the ovarian cancer data set from The Cancer Genome Atlas (TCGA) project to predict hRMST and show an improved prediction performance over the existing methods. A software implementation, srf using R and C++, is available at https://github.com/lmy1019/SRF.
Keywords: estimating equation; high dimensional data; inference; non-parametric survival estimation; regression forest.
Copyright © 2021 Liu and Li.
Conflict of interest statement
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Figures





Similar articles
-
Collaborative double robust targeted maximum likelihood estimation.Int J Biostat. 2010 May 17;6(1):Article 17. doi: 10.2202/1557-4679.1181. Int J Biostat. 2010. PMID: 20628637 Free PMC article.
-
Model selection for survival individualized treatment rules using the jackknife estimator.BMC Med Res Methodol. 2022 Dec 22;22(1):328. doi: 10.1186/s12874-022-01811-6. BMC Med Res Methodol. 2022. PMID: 36550398 Free PMC article.
-
Estimation with interval censored data and covariates.Lifetime Data Anal. 1997;3(1):77-91. doi: 10.1023/a:1009620319159. Lifetime Data Anal. 1997. PMID: 9384627
-
Ensemble methods for survival function estimation with time-varying covariates.Stat Methods Med Res. 2022 Nov;31(11):2217-2236. doi: 10.1177/09622802221111549. Epub 2022 Jul 27. Stat Methods Med Res. 2022. PMID: 35895510 Review.
-
Large-scale benchmark study of survival prediction methods using multi-omics data.Brief Bioinform. 2021 May 20;22(3):bbaa167. doi: 10.1093/bib/bbaa167. Brief Bioinform. 2021. PMID: 32823283 Free PMC article. Review.
Cited by
-
Improve individual treatment by comparing treatment benefits: cancer artificial intelligence survival analysis system for cervical carcinoma.J Transl Med. 2022 Jun 28;20(1):293. doi: 10.1186/s12967-022-03491-8. J Transl Med. 2022. PMID: 35765031 Free PMC article.
-
Differences of survival benefits brought by various treatments in ovarian cancer patients with different tumor stages.J Ovarian Res. 2023 May 11;16(1):92. doi: 10.1186/s13048-023-01173-7. J Ovarian Res. 2023. PMID: 37170143 Free PMC article.
References
-
- Andersen P. K., Gill R. D. (1982). Cox's regression model for counting processes: a large sample study. Ann. Stat. 10, 1100–1120. 10.1214/aos/1176345976 - DOI
-
- Athey S., Tibshirani J., Wager S. (2018). Generalized Random Forests. Technical report. Stanford, CA: Stanford University.
-
- Biau G. (2012). Analysis of a random forests model. J. Mach. Learn. Res. 13, 1063–1095.
-
- Biau G., Devroye L., Lugosi G. (2008). Consistency of random forests and other averaging classifiers. J. Mach. Learn. Res. 9, 2015–2033.
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Miscellaneous