Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2009 Dec 28:10:448.
doi: 10.1186/1471-2105-10-448.

Reconstructing nonlinear dynamic models of gene regulation using stochastic sampling

Affiliations

Reconstructing nonlinear dynamic models of gene regulation using stochastic sampling

Johanna Mazur et al. BMC Bioinformatics. .

Abstract

Background: The reconstruction of gene regulatory networks from time series gene expression data is one of the most difficult problems in systems biology. This is due to several reasons, among them the combinatorial explosion of possible network topologies, limited information content of the experimental data with high levels of noise, and the complexity of gene regulation at the transcriptional, translational and post-translational levels. At the same time, quantitative, dynamic models, ideally with probability distributions over model topologies and parameters, are highly desirable.

Results: We present a novel approach to infer such models from data, based on nonlinear differential equations, which we embed into a stochastic Bayesian framework. We thus address both the stochasticity of experimental data and the need for quantitative dynamic models. Furthermore, the Bayesian framework allows it to easily integrate prior knowledge into the inference process. Using stochastic sampling from the Bayes' posterior distribution, our approach can infer different likely network topologies and model parameters along with their respective probabilities from given data. We evaluate our approach on simulated data and the challenge #3 data from the DREAM 2 initiative. On the simulated data, we study effects of different levels of noise and dataset sizes. Results on real data show that the dynamics and main regulatory interactions are correctly reconstructed.

Conclusions: Our approach combines dynamic modeling using differential equations with a stochastic learning framework, thus bridging the gap between biophysical modeling and stochastic inference approaches. Results show that the method can reap the advantages of both worlds, and allows the reconstruction of biophysically accurate dynamic models from noisy data. In addition, the stochastic learning framework used permits the computation of probability distributions over models and model parameters, which holds interesting prospects for experimental design purposes.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Regulation functions. Hill functions fij for different Hill coefficients m = 1, 3, 5. The left plot shows an activation, the right plot an inhibitory effect. The threshold θj was chosen equal to 3 for both plots, at this concentration of the regulating gene j, half the maximum effect on gene i is achieved.
Figure 2
Figure 2
Lq Prior. Plot of the two-dimensional Lq prior p(β1, β2):= Lq(β1; q, s)·Lq(β2; q, s) for q = 0.5 and s = 1. It can clearly be seen how this prior favors points (β1, β2) where one of the two components is approximately zero over points at the same distance from the origin with both β1, β2 ≠ 0.
Figure 3
Figure 3
Gold Standard Topology and Simulated Data. (a) True network of the DREAM 2 challenge #3 five gene time series data, showing the bio-engineered interactions between the five genes artificially inserted into yeast. (b) Time course of simulation with model in arbitrary time and concentration units, for the simulated five gene model. Different numbers of equidistant time points from this data were used for network reconstruction in the simulation study. The time courses of gene 2 and gene 3 are almost the same.
Figure 4
Figure 4
AUC Values for Simulated Data. AUC values for different noise levels and different numbers of time points used for network reconstruction. The standard deviation of the noise was varied from σ = 0 to σ = 0.3, the number of time points from T = 10 to T = 200. The plots show (a) AUC values under the ROC curve and (b) AUC values for PR curves for varying T and σ. The blue surface indicates the AUCROC and AUCPR values that would follow for random guessing.
Figure 5
Figure 5
Learned Dynamics of DREAM 2 Challenge Data. Plot of the experimental data from the DREAM 2 challenge, in comparison to time courses simulated with reconstructed model parameters. Shown in black is the smoothed data.

Similar articles

Cited by

References

    1. de Jong H. Modeling and Simulation of Genetic Regulatory Systems: A Literature Review. Journal of Computational Biology. 2002;9:67–103. doi: 10.1089/10665270252833208. - DOI - PubMed
    1. Garner TS, Faith JJ. Reverse-engineering transcription control networks. Physics of Life Reviews. 2005;2:65–88. doi: 10.1016/j.plrev.2005.01.001. - DOI - PubMed
    1. Filkov V. In: Handbook of Computational Molecular Biology. Aluru S, editor. Boca Raton, FL, USA: Chapman & Hall/CRC Press; 2005. Identifying Gene Regulatory Networks from Gene Expression Data; pp. 27.1–27.29.
    1. Goutsias J, Lee NH. Computational and Experimental Approaches for Modeling Gene Regulatory Networks. Current Pharmaceutical Design. 2007;13:1415–1436. doi: 10.2174/138161207780765945. - DOI - PubMed
    1. Bansal M, Belcastro V, Ambesi-Impiombato A, di Bernardo D. How to infer gene networks from expression profiles. Molecular Systems Biology. 2007;3(78) - PMC - PubMed

Publication types

MeSH terms

LinkOut - more resources