Multiple gene expression profile alignment for microarray time-series data clustering
- PMID: 20639411
- DOI: 10.1093/bioinformatics/btq422
Multiple gene expression profile alignment for microarray time-series data clustering
Abstract
Motivation: Clustering gene expression data given in terms of time-series is a challenging problem that imposes its own particular constraints. Traditional clustering methods based on conventional similarity measures are not always suitable for clustering time-series data. A few methods have been proposed recently for clustering microarray time-series, which take the temporal dimension of the data into account. The inherent principle behind these methods is to either define a similarity measure appropriate for temporal expression data, or pre-process the data in such a way that the temporal relationships between and within the time-series are considered during the subsequent clustering phase.
Results: We introduce pairwise gene expression profile alignment, which vertically shifts two profiles in such a way that the area between their corresponding curves is minimal. Based on the pairwise alignment operation, we define a new distance function that is appropriate for time-series profiles. We also introduce a new clustering method that involves multiple expression profile alignment, which generalizes pairwise alignment to a set of profiles. Extensive experiments on well-known datasets yield encouraging results of at least 80% classification accuracy.
Similar articles
-
Techniques for clustering gene expression data.Comput Biol Med. 2008 Mar;38(3):283-93. doi: 10.1016/j.compbiomed.2007.11.001. Epub 2007 Dec 3. Comput Biol Med. 2008. PMID: 18061589 Review.
-
Detecting clusters of different geometrical shapes in microarray gene expression data.Bioinformatics. 2005 May 1;21(9):1927-34. doi: 10.1093/bioinformatics/bti251. Epub 2005 Jan 12. Bioinformatics. 2005. PMID: 15647300
-
A multi-stage approach to clustering and imputation of gene expression profiles.Bioinformatics. 2007 Apr 15;23(8):998-1005. doi: 10.1093/bioinformatics/btm053. Epub 2007 Feb 18. Bioinformatics. 2007. PMID: 17308340
-
Incorporating biological knowledge into distance-based clustering analysis of microarray gene expression data.Bioinformatics. 2006 May 15;22(10):1259-68. doi: 10.1093/bioinformatics/btl065. Epub 2006 Feb 24. Bioinformatics. 2006. PMID: 16500932
-
Comparing algorithms for clustering of expression data: how to assess gene clusters.Methods Mol Biol. 2009;541:479-509. doi: 10.1007/978-1-59745-243-4_21. Methods Mol Biol. 2009. PMID: 19381534 Review.
Cited by
-
Identifying molecular targets for reverse aging using integrated network analysis of transcriptomic and epigenomic changes during aging.Sci Rep. 2021 Jun 10;11(1):12317. doi: 10.1038/s41598-021-91811-1. Sci Rep. 2021. PMID: 34112891 Free PMC article.
-
Detecting qualitative changes in biological systems.Sci Rep. 2020 May 18;10(1):8146. doi: 10.1038/s41598-020-62578-8. Sci Rep. 2020. PMID: 32424123 Free PMC article.
-
Clustering of Bacterial Growth Dynamics in Response to Growth Media by Dynamic Time Warping.Microorganisms. 2020 Feb 26;8(3):331. doi: 10.3390/microorganisms8030331. Microorganisms. 2020. PMID: 32111085 Free PMC article.
-
Selecting the most appropriate time points to profile in high-throughput studies.Elife. 2017 Jan 26;6:e18541. doi: 10.7554/eLife.18541. Elife. 2017. PMID: 28124972 Free PMC article.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources