Consensus clustering and functional interpretation of gene-expression data
- PMID: 15535870
- PMCID: PMC545785
- DOI: 10.1186/gb-2004-5-11-r94
Consensus clustering and functional interpretation of gene-expression data
Abstract
Microarray analysis using clustering algorithms can suffer from lack of inter-method consistency in assigning related gene-expression profiles to clusters. Obtaining a consensus set of clusters from a number of clustering methods should improve confidence in gene-expression analysis. Here we introduce consensus clustering, which provides such an advantage. When coupled with a statistically based gene functional analysis, our method allowed the identification of novel genes regulated by NFkappaB and the unfolded protein response in certain B-cell lymphomas.
Figures
References
-
- Ward JH. Hierarchical grouping to optimize an objective function. J Am Stat Assoc. 1963;58:236–244.
-
- McQueen J. 5th Berkeley Symposium on Mathematical Statistics and Probability. Berkeley; 1967. Some methods for classification and analysis of multivariate observations. pp. 281–297.
-
- Kaufman L, Rousseeuw PJ. Clustering by means of medoids. In: Dodge Y, editor. Statistical Analysis Based Upon the L1 Norm. Amsterdam: North-Holland; 1987. pp. 405–416.
-
- Goldstein D, Ghosh D, Conlon E. Statistical issues in the clustering of gene expression data. Stat Sin. 2002;12:219–241.
