. 2012 Apr 15;28(8):1136-42.

doi: 10.1093/bioinformatics/bts092. Epub 2012 Feb 24.

A Bayesian approach to targeted experiment design

J Vanlier¹, C A Tiemann, P A J Hilbers, N A W van Riel

Affiliations

PMID: 22368245
PMCID: PMC3324513
DOI: 10.1093/bioinformatics/bts092

A Bayesian approach to targeted experiment design

J Vanlier et al. Bioinformatics. 2012.

. 2012 Apr 15;28(8):1136-42.

doi: 10.1093/bioinformatics/bts092. Epub 2012 Feb 24.

Authors

J Vanlier¹, C A Tiemann, P A J Hilbers, N A W van Riel

Affiliation

¹ Department of BioMedical Engineering, Eindhoven University of Technology, Eindhoven, The Netherlands. j.vanlier@tue.nl

PMID: 22368245
PMCID: PMC3324513
DOI: 10.1093/bioinformatics/bts092

Abstract

Motivation: Systems biology employs mathematical modelling to further our understanding of biochemical pathways. Since the amount of experimental data on which the models are parameterized is often limited, these models exhibit large uncertainty in both parameters and predictions. Statistical methods can be used to select experiments that will reduce such uncertainty in an optimal manner. However, existing methods for optimal experiment design (OED) rely on assumptions that are inappropriate when data are scarce considering model complexity.

Results: We have developed a novel method to perform OED for models that cope with large parameter uncertainty. We employ a Bayesian approach involving importance sampling of the posterior predictive distribution to predict the efficacy of a new measurement at reducing the uncertainty of a selected prediction. We demonstrate the method by applying it to a case where we show that specific combinations of experiments result in more precise predictions.

Availability and implementation: Source code is available at: http://bmi.bmt.tue.nl/sysbio/software/pua.html.

PubMed Disclaimer

Figures

**Fig. 1.**
Illustration of the effect of adding a new data point on the PPD. Shown on the top right is the PPD at one specific time point for two predictions with a subset of the samples of the chain indicated with white points. The square denotes the location of the ‘new measurement’. Prediction A refers to a prediction of which a new measurement can be performed (observable), whereas B denotes the prediction of interest. Here the grey distribution corresponds to the PPD before the new measurement, whereas the white Gaussian corresponds to the error model of the new measurement. Due to additional constraints imposed by this new measurement in combination with the old data and the model, the distribution on the hypothesis side is also updated in light of the new data point and shown in white.

**Fig. 2.**
Model of the JAK-STAT pathway. In this model u₁ serves as driving input, while the total concentration of STAT (x₁+x₂+2x₃) and the total concentration of phosphorylated STAT in the cytoplasm (x₂+2x₃) were measured. Note that the step from x₄ back to x₁ is associated with a delay.

**Fig. 3.**
Top left: one simulated time course of state 3 superimposed on the PPD. Two time points are indicated with circles. Bottom left: correlation coefficient between states 3 and 4 and SVR of state 4 based on a measurement of state 3 (SVR). The relation between the two states at the indicated time points is shown in both scatter plot and 2D histogram form. The former shows the actual samples from the PPD for one point in time. Here the dots represent simulated values belonging to different parameter sets from the MCMC chain. In the histogram the colour indicates the number of samples in a particular region which is proportional to the probability density.

**Fig. 4.**
Variance reduction of the peak time of dimerized STAT (x₄) with respect to two new measurements. (A) Each axis represents an experiment, where the different model outputs are numbered. Numbers 1 to 3 correspond to the first three states whereas 4 and 5 correspond to the sums of states on which the original PPD was parametrized. Note that each block on each axis corresponds to an entire time series. The block corresponding to experiments involving state 1 is shown enlarged in (B). Variance reduction is computed using the importance sampling method.

**Fig. 5.**
Comparison of two methods for calculating the variance reduction. Variance reduction of the peak time of dimerized STAT (x₄) with respect to two new measurements. (A) LVR. (B) Difference between the variance reduction computed by means of LVR and importance sampling (shown in Fig. 4).

See this image and copyright information in PMC

Cited by

Standing Variations Modeling Captures Inter-Individual Heterogeneity in a Deterministic Model of Prostate Cancer Response to Combination Therapy.
Jain HV, Sorribes IC, Handelman SK, Barnaby J, Jackson TL. Jain HV, et al. Cancers (Basel). 2021 Apr 14;13(8):1872. doi: 10.3390/cancers13081872. Cancers (Basel). 2021. PMID: 33919753 Free PMC article.
Iterative experiment design guides the characterization of a light-inducible gene expression circuit.
Ruess J, Parise F, Milias-Argeitis A, Khammash M, Lygeros J. Ruess J, et al. Proc Natl Acad Sci U S A. 2015 Jun 30;112(26):8148-53. doi: 10.1073/pnas.1423947112. Epub 2015 Jun 17. Proc Natl Acad Sci U S A. 2015. PMID: 26085136 Free PMC article.
PEITH(Θ): perfecting experiments with information theory in Python with GPU support.
Dony L, Mackerodt J, Ward S, Filippi S, Stumpf MPH, Liepe J. Dony L, et al. Bioinformatics. 2018 Apr 1;34(7):1249-1250. doi: 10.1093/bioinformatics/btx776. Bioinformatics. 2018. PMID: 29228182 Free PMC article.
Quantifying the relative importance of experimental data points in parameter estimation.
Jeong JE, Qiu P. Jeong JE, et al. BMC Syst Biol. 2018 Nov 22;12(Suppl 6):103. doi: 10.1186/s12918-018-0622-6. BMC Syst Biol. 2018. PMID: 30463558 Free PMC article.
Optimal experiment design for model selection in biochemical networks.
Vanlier J, Tiemann CA, Hilbers PA, van Riel NA. Vanlier J, et al. BMC Syst Biol. 2014 Feb 20;8:20. doi: 10.1186/1752-0509-8-20. BMC Syst Biol. 2014. PMID: 24555498 Free PMC article.

See all "Cited by" articles

References

1. Brännmark C., et al. Mass and information feedbacks through receptor endocytosis govern insulin signaling as revealed using a parameter-free modeling framework. J. Biol. Chem. 2010;285:20171. - PMC - PubMed
1. Brown K.S., Sethna J.P. Statistical mechanical approaches to models with many poorly known parameters. Phys. Rev. E. 2003;68:021904. - PubMed
1. Calderhead B., Girolami M. Statistical analysis of nonlinear dynamical systems using differential geometric sampling methods. J. R. Soc. Interface Focus. 2011;1:821–835. - PMC - PubMed
1. Casey F., et al. Optimal experimental design in an epidermal growth factor receptor signalling and down-regulation model. Syst. Biol. IET. 2007;1:190–202. - PubMed
1. Cedersund G., Roll J. Systems biology: model based evaluation and comparison of potential explanations for given biological data. FEBS J. 2009;276:903–922. - PubMed

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Substances

Actions
Actions

LinkOut - more resources

Full Text Sources
Other Literature Sources
- The Lens - Patent Citations Database

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

A Bayesian approach to targeted experiment design

Affiliation

A Bayesian approach to targeted experiment design

Authors

Affiliation

Abstract

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Substances

LinkOut - more resources

Full Text Sources

Other Literature Sources