Semi-supervised methods to predict patient survival from gene expression data
- PMID: 15094809
- PMCID: PMC387275
- DOI: 10.1371/journal.pbio.0020108
Semi-supervised methods to predict patient survival from gene expression data
Abstract
An important goal of DNA microarray research is to develop tools to diagnose cancer more accurately based on the genetic profile of a tumor. There are several existing techniques in the literature for performing this type of diagnosis. Unfortunately, most of these techniques assume that different subtypes of cancer are already known to exist. Their utility is limited when such subtypes have not been previously identified. Although methods for identifying such subtypes exist, these methods do not work well for all datasets. It would be desirable to develop a procedure to find such subtypes that is applicable in a wide variety of circumstances. Even if no information is known about possible subtypes of a certain form of cancer, clinical information about the patients, such as their survival time, is often available. In this study, we develop some procedures that utilize both the gene expression data and the clinical data to identify subtypes of cancer and use this knowledge to diagnose future patients. These procedures were successfully applied to several publicly available datasets. We present diagnostic procedures that accurately predict the survival of future patients based on the gene expression profile and survival times of previous patients. This has the potential to be a powerful tool for diagnosing and treating cancer.
Conflict of interest statement
The authors have declared that no conflicts of interest exist.
Figures






Similar articles
-
Semi-supervised recursively partitioned mixture models for identifying cancer subtypes.Bioinformatics. 2010 Oct 15;26(20):2578-85. doi: 10.1093/bioinformatics/btq470. Epub 2010 Aug 16. Bioinformatics. 2010. PMID: 20834038 Free PMC article.
-
An entropy-based gene selection method for cancer classification using microarray data.BMC Bioinformatics. 2005 Mar 24;6:76. doi: 10.1186/1471-2105-6-76. BMC Bioinformatics. 2005. PMID: 15790388 Free PMC article.
-
Cancer survival analysis using semi-supervised learning method based on Cox and AFT models with L1/2 regularization.BMC Med Genomics. 2016 Mar 1;9:11. doi: 10.1186/s12920-016-0169-6. BMC Med Genomics. 2016. PMID: 26932592 Free PMC article.
-
Transcriptomic signatures in breast cancer.Mol Biosyst. 2007 Jul;3(7):466-72. doi: 10.1039/b618163e. Epub 2007 Jun 5. Mol Biosyst. 2007. PMID: 17579771 Review.
-
Statistical challenges in preprocessing in microarray experiments in cancer.Clin Cancer Res. 2008 Oct 1;14(19):5959-66. doi: 10.1158/1078-0432.CCR-07-4532. Clin Cancer Res. 2008. PMID: 18829474 Free PMC article. Review.
Cited by
-
Cross-validation and Peeling Strategies for Survival Bump Hunting using Recursive Peeling Methods.Stat Anal Data Min. 2016 Feb;9(1):12-42. doi: 10.1002/sam.11301. Epub 2016 Jan 22. Stat Anal Data Min. 2016. PMID: 27034730 Free PMC article.
-
Improving ChIP-seq peak-calling for functional co-regulator binding by integrating multiple sources of biological information.BMC Genomics. 2012;13 Suppl 1(Suppl 1):S1. doi: 10.1186/1471-2164-13-S1-S1. Epub 2012 Jan 17. BMC Genomics. 2012. PMID: 22369349 Free PMC article.
-
A network-pathway based module identification for predicting the prognosis of ovarian cancer patients.J Ovarian Res. 2016 Nov 2;9(1):73. doi: 10.1186/s13048-016-0285-0. J Ovarian Res. 2016. PMID: 27806724 Free PMC article.
-
Differential gene expression and AKT targeting in triple negative breast cancer.Oncotarget. 2019 Jul 9;10(43):4356-4368. doi: 10.18632/oncotarget.27026. eCollection 2019 Jul 9. Oncotarget. 2019. PMID: 31320990 Free PMC article.
-
Innate gene repression associated with Mycobacterium bovis infection in cattle: toward a gene signature of disease.BMC Genomics. 2007 Oct 31;8:400. doi: 10.1186/1471-2164-8-400. BMC Genomics. 2007. PMID: 17974019 Free PMC article.
References
-
- Alizadeh AA, Eisen MB, Davis RE, Ma C, Lossos IS, et al. Distinct types of diffuse large B-cell lymphoma identified by gene expression profiling. Nature. 2000;403:503–511. - PubMed
-
- Beer DG, Kardia SL, Huang CC, Giordano TJ, Levin AM, et al. Gene-expression profiles predict survival of patients with lung adenocarcinoma. Nat Med. 2002;8:816–824. - PubMed
-
- Ben-Dor A, Friedman N, Yakhini Z. Montreal (Quebec): ACM Press; 2001. Class discovery in gene expression data. Proceedings of the fifth annual international conference on computational biology; pp. 31–38. - DOI
-
- Bullinger L, Döhner K, Bair E, Fröhling S, Schlenk R, et al. Gene expression profiling identifies new subclasses and improves outcome prediction in adult myeloid leukemia. N Engl J Med. 2004 in press. - PubMed
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources