Partial Cox regression analysis for high-dimensional microarray gene expression data
- PMID: 15262801
- DOI: 10.1093/bioinformatics/bth900
Partial Cox regression analysis for high-dimensional microarray gene expression data
Abstract
Motivation: An important application of microarray technology is to predict various clinical phenotypes based on the gene expression profile. Success has been demonstrated in molecular classification of cancer in which different types of cancer serve as categorical outcome variable. However, there has been less research in linking gene expression profile to censored survival outcome such as patients' overall survival time or time to cancer relapse. In this paper, we develop a partial Cox regression method for constructing mutually uncorrelated components based on microarray gene expression data for predicting the survival of future patients.
Results: The proposed partial Cox regression method involves constructing predictive components by repeated least square fitting of residuals and Cox regression fitting. The key difference from the standard principal components of Cox regression analysis is that in constructing the predictive components, our method utilizes the observed survival/censoring information. We also propose to apply the time-dependent receiver operating characteristic curve analysis to evaluate the results. We applied our methods to a publicly available dataset of diffuse large B-cell lymphoma. The outcomes indicated that combining the partial Cox regression method with principal components analysis results in parsimonious model with fewer components and better predictive performance. We conclude that the proposed partial Cox regression method can be very useful in building a parsimonious predictive model that can accurately predict the survival of future patients based on the gene expression profile and survival times of previous patients.
Availability: R codes are available upon request.
Similar articles
-
Penalized Cox regression analysis in the high-dimensional and low-sample size settings, with applications to microarray gene expression data.Bioinformatics. 2005 Jul 1;21(13):3001-8. doi: 10.1093/bioinformatics/bti422. Epub 2005 Apr 6. Bioinformatics. 2005. PMID: 15814556
-
Dimension reduction methods for microarrays with application to censored survival data.Bioinformatics. 2004 Dec 12;20(18):3406-12. doi: 10.1093/bioinformatics/bth415. Epub 2004 Jul 15. Bioinformatics. 2004. PMID: 15256406
-
Boosting proportional hazards models using smoothing splines, with applications to high-dimensional microarray data.Bioinformatics. 2005 May 15;21(10):2403-9. doi: 10.1093/bioinformatics/bti324. Epub 2005 Feb 15. Bioinformatics. 2005. PMID: 15713732
-
Cross-study analysis of gene expression data for intermediate neuroblastoma identifies two biological subtypes.BMC Cancer. 2007 May 25;7:89. doi: 10.1186/1471-2407-7-89. BMC Cancer. 2007. PMID: 17531100 Free PMC article. Review.
-
Time-dependent covariates in the Cox proportional-hazards regression model.Annu Rev Public Health. 1999;20:145-57. doi: 10.1146/annurev.publhealth.20.1.145. Annu Rev Public Health. 1999. PMID: 10352854 Review.
Cited by
-
Network-based survival analysis reveals subnetwork signatures for predicting outcomes of ovarian cancer treatment.PLoS Comput Biol. 2013;9(3):e1002975. doi: 10.1371/journal.pcbi.1002975. Epub 2013 Mar 21. PLoS Comput Biol. 2013. PMID: 23555212 Free PMC article.
-
Gene-expression signature predicts postoperative recurrence in stage I non-small cell lung cancer patients.PLoS One. 2012;7(1):e30880. doi: 10.1371/journal.pone.0030880. Epub 2012 Jan 23. PLoS One. 2012. PMID: 22292069 Free PMC article.
-
JCDSA: a joint covariate detection tool for survival analysis on tumor expression profiles.BMC Bioinformatics. 2018 May 29;19(1):187. doi: 10.1186/s12859-018-2213-3. BMC Bioinformatics. 2018. PMID: 29843599 Free PMC article.
-
XRN2 promotes EMT and metastasis through regulating maturation of miR-10a.Oncogene. 2017 Jul 6;36(27):3925-3933. doi: 10.1038/onc.2017.39. Epub 2017 Mar 20. Oncogene. 2017. PMID: 28319071
-
Genetic co-expression networks contribute to creating predictive model and exploring novel biomarkers for the prognosis of breast cancer.Sci Rep. 2021 Mar 31;11(1):7268. doi: 10.1038/s41598-021-84995-z. Sci Rep. 2021. PMID: 33790307 Free PMC article.
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources