Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2010 Jun;17(6):853-68.
doi: 10.1089/cmb.2008.0023.

Discretization of time series data

Affiliations

Discretization of time series data

Elena S Dimitrova et al. J Comput Biol. 2010 Jun.

Abstract

An increasing number of algorithms for biochemical network inference from experimental data require discrete data as input. For example, dynamic Bayesian network methods and methods that use the framework of finite dynamical systems, such as Boolean networks, all take discrete input. Experimental data, however, are typically continuous and represented by computer floating point numbers. The translation from continuous to discrete data is crucial in preserving the variable dependencies and thus has a significant impact on the performance of the network inference algorithms. We compare the performance of two such algorithms that use discrete data using several different discretization algorithms. One of the inference methods uses a dynamic Bayesian network framework, the other-a time-and state-discrete dynamical system framework. The discretization algorithms are quantile, interval discretization, and a new algorithm introduced in this article, SSD. SSD is especially designed for short time series data and is capable of determining the optimal number of discretization states. The experiments show that both inference methods perform better with SSD than with the other methods. In addition, SSD is demonstrated to preserve the dynamic features of the time series, as well as to be robust to noise in the experimental data. A C++ implementation of SSD is available from the authors at http://polymath.vbi.vt.edu/discretization .

PubMed Disclaimer

Figures

FIG. 1.
FIG. 1.
Dentrogram representing the SLC algorithm applied to the data of Example 3.2. The column on the right gives the corresponding Shannon's entropy increasing at each consecutive level.
FIG. 2.
FIG. 2.
The complete weighted graph constructed from vector entries 1, 2, 7, 9, 10, and 11. Only the edge weights of the outer edges are given.
FIG. 3.
FIG. 3.
(A) Dependency graph of the A-Biochem-generated artificial gene network on five genes Gi. (B) Dependency graph of the reverse-engineered system.
FIG. 4.
FIG. 4.
(A) Wild-type time series generated by solving numerically the ODE system. (B) Corresponding discrete point time series.
FIG. 5.
FIG. 5.
Network 1: 10 genes and three environmental perturbations. In this network, the three environmental perturbations P1, P2, and P3 directly affect the expression rate of genes G1, G2, and G5, respectively.
FIG. 6.
FIG. 6.
Three in silico networks. The first network consists of 10 genes and the second and third networks consist of 5 genes. In these three networks, the interactions between genes are phenomenological and represent the result of the effect of transcription and translation on the regulation of the genes in the network.

Similar articles

Cited by

References

    1. Akutsu T. Miyano S. Inferring qualitative relations in genetic networks and metabolic pathways. Bioinformatics. 2000;16:727–734. - PubMed
    1. Albert R. Barabási A. Topology of evolving networks: local events and universality. Phys. Rev. Lett. 2000;85:5234–5237. - PubMed
    1. Albert R. Othmer H. The topology of the regulatory interactions predict the expression pattern of the segment polarity genes in Drosophila melanogaster. J. Theor. Biol. 2003;223:1–18. - PMC - PubMed
    1. Andrec M. Kholodenko B.N. Inference of signaling and gene regulatory networks by steady-state perturbation experiments: structure and accuracy. J. Theor. Biol. 2005;232:427. - PubMed
    1. Beal M.J. Falciani F. A Bayesian approach to reconstructing genetic regulatory networks with hidden factors. Bioinformatics. 2005;21:349–356. - PubMed

Publication types

LinkOut - more resources