An efficient method for mining cross-timepoint gene regulation sequential patterns from time course gene expression datasets
- PMID: 24267918
- PMCID: PMC3848764
- DOI: 10.1186/1471-2105-14-S12-S3
An efficient method for mining cross-timepoint gene regulation sequential patterns from time course gene expression datasets
Abstract
Background: Observation of gene expression changes implying gene regulations using a repetitive experiment in time course has become more and more important. However, there is no effective method which can handle such kind of data. For instance, in a clinical/biological progression like inflammatory response or cancer formation, a great number of differentially expressed genes at different time points could be identified through a large-scale microarray approach. For each repetitive experiment with different samples, converting the microarray datasets into transactional databases with significant singleton genes at each time point would allow sequential patterns implying gene regulations to be identified. Although traditional sequential pattern mining methods have been successfully proposed and widely used in different interesting topics, like mining customer purchasing sequences from a transactional database, to our knowledge, the methods are not suitable for such biological dataset because every transaction in the converted database may contain too many items/genes.
Results: In this paper, we propose a new algorithm called CTGR-Span (Cross-Timepoint Gene Regulation Sequential pattern) to efficiently mine CTGR-SPs (Cross-Timepoint Gene Regulation Sequential Patterns) even on larger datasets where traditional algorithms are infeasible. The CTGR-Span includes several biologically designed parameters based on the characteristics of gene regulation. We perform an optimal parameter tuning process using a GO enrichment analysis to yield CTGR-SPs more meaningful biologically. The proposed method was evaluated with two publicly available human time course microarray datasets and it was shown that it outperformed the traditional methods in terms of execution efficiency. After evaluating with previous literature, the resulting patterns also strongly correlated with the experimental backgrounds of the datasets used in this study.
Conclusions: We propose an efficient CTGR-Span to mine several biologically meaningful CTGR-SPs. We postulate that the biologist can benefit from our new algorithm since the patterns implying gene regulations could provide further insights into the mechanisms of novel gene regulations during a biological or clinical progression. The Java source code, program tutorial and other related materials used in this program are available at http://websystem.csie.ncku.edu.tw/CTGR-Span.rar.
Figures
Similar articles
-
MiningABs: mining associated biomarkers across multi-connected gene expression datasets.BMC Bioinformatics. 2014 Jun 8;15:173. doi: 10.1186/1471-2105-15-173. BMC Bioinformatics. 2014. PMID: 24909518 Free PMC article.
-
Mining significant high utility gene regulation sequential patterns.BMC Syst Biol. 2017 Dec 14;11(Suppl 6):109. doi: 10.1186/s12918-017-0475-4. BMC Syst Biol. 2017. PMID: 29297335 Free PMC article.
-
Mining differential top-k co-expression patterns from time course comparative gene expression datasets.BMC Bioinformatics. 2013 Jul 21;14:230. doi: 10.1186/1471-2105-14-230. BMC Bioinformatics. 2013. PMID: 23870110 Free PMC article.
-
Efficient mining differential co-expression biclusters in microarray datasets.Gene. 2013 Apr 10;518(1):59-69. doi: 10.1016/j.gene.2012.11.085. Epub 2012 Dec 28. Gene. 2013. PMID: 23276708
-
Extracting regulatory modules from gene expression data by sequential pattern mining.BMC Genomics. 2011 Nov 30;12 Suppl 3(Suppl 3):S5. doi: 10.1186/1471-2164-12-S3-S5. Epub 2011 Nov 30. BMC Genomics. 2011. PMID: 22369275 Free PMC article.
Cited by
-
MiningABs: mining associated biomarkers across multi-connected gene expression datasets.BMC Bioinformatics. 2014 Jun 8;15:173. doi: 10.1186/1471-2105-15-173. BMC Bioinformatics. 2014. PMID: 24909518 Free PMC article.
-
Mining significant high utility gene regulation sequential patterns.BMC Syst Biol. 2017 Dec 14;11(Suppl 6):109. doi: 10.1186/s12918-017-0475-4. BMC Syst Biol. 2017. PMID: 29297335 Free PMC article.
References
-
- Shie B-E, Cheng J-H, Chuang K-T, Tseng V. In: A One-Phase Method for Mining High Utility Mobile Sequential Patterns in Mobile Commerce Environments Advanced Research in Applied Artificial Intelligence. Jiang H, Ding W, Ali M, Wu X, editor. Vol. 14. Springer Berlin/Heidelberg; 2012. pp. 616–626.
-
- Cheng CP, Lee PF, Chin CY, Liu WC, Wu IC, Chang TT, Tseng VS. Discovering Clinical Biomarkers of Chronic Hepatitis B by Mining Mutation Hotspots. Technologies and Applications of Artificial Intelligence (TAAI), 2011 International Conference on: 11-13 Nov. 2011. 2011. pp. 51–56.
-
- McDunn JE, Husain KD, Polpitiya AD, Burykin A, Ruan J, Li Q, Schierding W, Lin N, Dixon D, Zhang W. et al.Plasticity of the systemic inflammatory response to acute infection during critical illness: development of the riboleukogram. PloS one. 2008;14(2):e1564. doi: 10.1371/journal.pone.0001564. - DOI - PMC - PubMed
-
- Taylor MW, Tsukahara T, McClintick JN, Edenberg HJ, Kwo P. Cyclic changes in gene expression induced by Peg-interferon alfa-2b plus ribavirin in peripheral blood monocytes (PBMC) of hepatitis C patients during the first 10 weeks of treatment. Journal of translational medicine. 2008;14:66. doi: 10.1186/1479-5876-6-66. - DOI - PMC - PubMed
-
- Yoshizuka N, Yoshizuka-Chadani Y, Krishnan V, Zeichner SL. Human immunodeficiency virus type 1 Vpr-dependent cell cycle arrest through a mitogen-activated protein kinase signal transduction pathway. Journal of virology. 2005;14(17):11366–11381. doi: 10.1128/JVI.79.17.11366-11381.2005. - DOI - PMC - PubMed
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources
Molecular Biology Databases