Discretization of time series data
- PMID: 20583929
- PMCID: PMC3203514
- DOI: 10.1089/cmb.2008.0023
Discretization of time series data
Abstract
An increasing number of algorithms for biochemical network inference from experimental data require discrete data as input. For example, dynamic Bayesian network methods and methods that use the framework of finite dynamical systems, such as Boolean networks, all take discrete input. Experimental data, however, are typically continuous and represented by computer floating point numbers. The translation from continuous to discrete data is crucial in preserving the variable dependencies and thus has a significant impact on the performance of the network inference algorithms. We compare the performance of two such algorithms that use discrete data using several different discretization algorithms. One of the inference methods uses a dynamic Bayesian network framework, the other-a time-and state-discrete dynamical system framework. The discretization algorithms are quantile, interval discretization, and a new algorithm introduced in this article, SSD. SSD is especially designed for short time series data and is capable of determining the optimal number of discretization states. The experiments show that both inference methods perform better with SSD than with the other methods. In addition, SSD is demonstrated to preserve the dynamic features of the time series, as well as to be robust to noise in the experimental data. A C++ implementation of SSD is available from the authors at http://polymath.vbi.vt.edu/discretization .
Figures






Similar articles
-
Reconstructing gene-regulatory networks from time series, knock-out data, and prior knowledge.BMC Syst Biol. 2007 Feb 2;1:11. doi: 10.1186/1752-0509-1-11. BMC Syst Biol. 2007. PMID: 17408501 Free PMC article.
-
An algebra-based method for inferring gene regulatory networks.BMC Syst Biol. 2014 Mar 26;8:37. doi: 10.1186/1752-0509-8-37. BMC Syst Biol. 2014. PMID: 24669835 Free PMC article.
-
Benchmarking time-series data discretization on inference methods.Bioinformatics. 2019 Sep 1;35(17):3102-3109. doi: 10.1093/bioinformatics/btz036. Bioinformatics. 2019. PMID: 30657860
-
A review on the computational approaches for gene regulatory network construction.Comput Biol Med. 2014 May;48:55-65. doi: 10.1016/j.compbiomed.2014.02.011. Epub 2014 Feb 24. Comput Biol Med. 2014. PMID: 24637147 Review.
-
Inference of dynamic networks using time-course data.Brief Bioinform. 2014 Mar;15(2):212-28. doi: 10.1093/bib/bbt028. Epub 2013 May 21. Brief Bioinform. 2014. PMID: 23698724 Review.
Cited by
-
The Association Between the Acute:Chronic Workload Ratio and Running-Related Injuries in Dutch Runners: A Prospective Cohort Study.Sports Med. 2021 Nov;51(11):2437-2447. doi: 10.1007/s40279-021-01483-0. Epub 2021 May 30. Sports Med. 2021. PMID: 34052983
-
Associating expression and genomic data using co-occurrence measures.Biol Direct. 2019 May 9;14(1):10. doi: 10.1186/s13062-019-0240-2. Biol Direct. 2019. PMID: 31072345 Free PMC article.
-
Integrating genomics and proteomics data to predict drug effects using binary linear programming.PLoS One. 2014 Jul 18;9(7):e102798. doi: 10.1371/journal.pone.0102798. eCollection 2014. PLoS One. 2014. PMID: 25036040 Free PMC article.
-
Evaluating Uncertainty in Signaling Networks Using Logical Modeling.Front Physiol. 2018 Oct 9;9:1335. doi: 10.3389/fphys.2018.01335. eCollection 2018. Front Physiol. 2018. PMID: 30364151 Free PMC article.
-
Identification of altered biological processes in heterogeneous RNA-sequencing data by discretization of expression profiles.Nucleic Acids Res. 2020 Feb 28;48(4):1730-1747. doi: 10.1093/nar/gkz1208. Nucleic Acids Res. 2020. PMID: 31889184 Free PMC article.
References
-
- Akutsu T. Miyano S. Inferring qualitative relations in genetic networks and metabolic pathways. Bioinformatics. 2000;16:727–734. - PubMed
-
- Albert R. Barabási A. Topology of evolving networks: local events and universality. Phys. Rev. Lett. 2000;85:5234–5237. - PubMed
-
- Andrec M. Kholodenko B.N. Inference of signaling and gene regulatory networks by steady-state perturbation experiments: structure and accuracy. J. Theor. Biol. 2005;232:427. - PubMed
-
- Beal M.J. Falciani F. A Bayesian approach to reconstructing genetic regulatory networks with hidden factors. Bioinformatics. 2005;21:349–356. - PubMed
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources