Clustering of significant genes in prognostic studies with microarrays: application to a clinical study for multiple myeloma
- PMID: 17680552
- DOI: 10.1002/sim.2997
Clustering of significant genes in prognostic studies with microarrays: application to a clinical study for multiple myeloma
Abstract
When a large number of genes are significant in correlating microarray gene expression data with patient prognosis, clustering of significant genes may be effective not only for further dimension reduction but also for identifying co-regulated genes that belong to the same molecular pathway related to disease biology and aggressiveness. Moreover, a reduced feature, such as the average expression across samples for a cluster of significant genes, can play an important role in reducing variance in prediction analysis. We propose a simple procedure to select gene clusters that have strong marginal association with survival outcome from a large pool of candidate hierarchical clusters of significant genes. Selected gene clusters can have better predictive capability than the other gene clusters and singleton genes. Application of such clustering to the data set from a clinical study for patients with multiple myeloma and associated microarrays is given.
Copyright (c) 2007 John Wiley & Sons, Ltd.
Similar articles
-
Clustering threshold gradient descent regularization: with applications to microarray studies.Bioinformatics. 2007 Feb 15;23(4):466-72. doi: 10.1093/bioinformatics/btl632. Epub 2006 Dec 20. Bioinformatics. 2007. PMID: 17182700
-
Identification of survival-related genes of the phosphatidylinositol 3'-kinase signaling pathway in glioblastoma multiforme.Cancer. 2008 Apr 1;112(7):1575-84. doi: 10.1002/cncr.23338. Cancer. 2008. PMID: 18260157
-
IGF axis gene expression patterns are prognostic of survival in epithelial ovarian cancer.Endocr Relat Cancer. 2007 Sep;14(3):781-90. doi: 10.1677/ERC-06-0073. Endocr Relat Cancer. 2007. PMID: 17914107
-
Random forests for microarrays.Methods Enzymol. 2006;411:422-32. doi: 10.1016/S0076-6879(06)11023-X. Methods Enzymol. 2006. PMID: 16939804 Review.
-
Statistical analysis of oligonucleotide microarray data.C R Biol. 2004 Mar;327(3):175-80. doi: 10.1016/j.crvi.2003.05.003. C R Biol. 2004. PMID: 15127888 Review.
Cited by
-
Bayesian hierarchical clustering for studying cancer gene expression data with unknown statistics.PLoS One. 2013 Oct 23;8(10):e75748. doi: 10.1371/journal.pone.0075748. eCollection 2013. PLoS One. 2013. PMID: 24194826 Free PMC article.
-
Improving the accuracy and internal consistency of regression-based clustering of high-dimensional datasets.Stat Appl Genet Mol Biol. 2023 Jul 25;22(1):10.1515/sagmb-2022-0031. doi: 10.1515/sagmb-2022-0031. eCollection 2023 Jan 1. Stat Appl Genet Mol Biol. 2023. PMID: 37489035 Free PMC article.
-
Centrosome associated genes pattern for risk sub-stratification in multiple myeloma.J Transl Med. 2016 May 28;14(1):150. doi: 10.1186/s12967-016-0906-9. J Transl Med. 2016. PMID: 27234807 Free PMC article.
MeSH terms
LinkOut - more resources
Medical