Linear modes of gene expression determined by independent component analysis
- PMID: 11836211
- DOI: 10.1093/bioinformatics/18.1.51
Linear modes of gene expression determined by independent component analysis
Abstract
Motivation: The expression of genes is controlled by specific combinations of cellular variables. We applied Independent Component Analysis (ICA) to gene expression data, deriving a linear model based on hidden variables, which we term 'expression modes'. The expression of each gene is a linear function of the expression modes, where, according to the ICA model, the linear influences of different modes show a minimal statistical dependence, and their distributions deviate sharply from the normal distribution.
Results: Studying cell cycle-related gene expression in yeast, we found that the dominant expression modes could be related to distinct biological functions, such as phases of the cell cycle or the mating response. Analysis of human lymphocytes revealed modes that were related to characteristic differences between cell types. With both data sets, the linear influences of the dominant modes showed distributions with large tails, indicating the existence of specifically up- and downregulated target genes. The expression modes and their influences can be used to visualize the samples and genes in low-dimensional spaces. A projection to expression modes helps to highlight particular biological functions, to reduce noise, and to compress the data in a biologically sensible way.
Similar articles
-
Beyond synexpression relationships: local clustering of time-shifted and inverted gene expression profiles identifies new, biologically relevant interactions.J Mol Biol. 2001 Dec 14;314(5):1053-66. doi: 10.1006/jmbi.2000.5219. J Mol Biol. 2001. PMID: 11743722
-
Cluster analysis of dynamic parameters of gene expression.J Bioinform Comput Biol. 2003 Oct;1(3):447-58. doi: 10.1142/s0219720003000307. J Bioinform Comput Biol. 2003. PMID: 15290764
-
Possibilistic approach for biclustering microarray data.Comput Biol Med. 2007 Oct;37(10):1426-36. doi: 10.1016/j.compbiomed.2007.01.005. Epub 2007 Mar 8. Comput Biol Med. 2007. PMID: 17346690
-
Assessing gene significance from cDNA microarray expression data via mixed models.J Comput Biol. 2001;8(6):625-37. doi: 10.1089/106652701753307520. J Comput Biol. 2001. PMID: 11747616
-
Fast optimal leaf ordering for hierarchical clustering.Bioinformatics. 2001;17 Suppl 1:S22-9. doi: 10.1093/bioinformatics/17.suppl_1.s22. Bioinformatics. 2001. PMID: 11472989
Cited by
-
Deciphering modular and dynamic behaviors of transcriptional networks.Genomic Med. 2007;1(1-2):19-28. doi: 10.1007/s11568-007-9004-7. Epub 2007 May 11. Genomic Med. 2007. PMID: 18923925 Free PMC article.
-
robustica: customizable robust independent component analysis.BMC Bioinformatics. 2022 Dec 5;23(1):519. doi: 10.1186/s12859-022-05043-9. BMC Bioinformatics. 2022. PMID: 36471244 Free PMC article.
-
Multivariate curve resolution of time course microarray data.BMC Bioinformatics. 2006 Jul 13;7:343. doi: 10.1186/1471-2105-7-343. BMC Bioinformatics. 2006. PMID: 16839419 Free PMC article.
-
Tumor Classification Using High-Order Gene Expression Profiles Based on Multilinear ICA.Adv Bioinformatics. 2009;2009:926450. doi: 10.1155/2009/926450. Epub 2009 Jul 20. Adv Bioinformatics. 2009. PMID: 19956422 Free PMC article.
-
Modelling transcriptional regulation with a mixture of factor analyzers and variational Bayesian expectation maximization.EURASIP J Bioinform Syst Biol. 2009;2009(1):601068. doi: 10.1155/2009/601068. Epub 2009 Jun 11. EURASIP J Bioinform Syst Biol. 2009. PMID: 19572011 Free PMC article.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources
Molecular Biology Databases
Miscellaneous