Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Comparative Study
. 2017 Sep;14(134):20170340.
doi: 10.1098/rsif.2017.0340.

Comparing two sequential Monte Carlo samplers for exact and approximate Bayesian inference on biological models

Affiliations
Comparative Study

Comparing two sequential Monte Carlo samplers for exact and approximate Bayesian inference on biological models

Aidan C Daly et al. J R Soc Interface. 2017 Sep.

Abstract

Bayesian methods are advantageous for biological modelling studies due to their ability to quantify and characterize posterior variability in model parameters. When Bayesian methods cannot be applied, due either to non-determinism in the model or limitations on system observability, approximate Bayesian computation (ABC) methods can be used to similar effect, despite producing inflated estimates of the true posterior variance. Owing to generally differing application domains, there are few studies comparing Bayesian and ABC methods, and thus there is little understanding of the properties and magnitude of this uncertainty inflation. To address this problem, we present two popular strategies for ABC sampling that we have adapted to perform exact Bayesian inference, and compare them on several model problems. We find that one sampler was impractical for exact inference due to its sensitivity to a key normalizing constant, and additionally highlight sensitivities of both samplers to various algorithmic parameters and model conditions. We conclude with a study of the O'Hara-Rudy cardiac action potential model to quantify the uncertainty amplification resulting from employing ABC using a set of clinically relevant biomarkers. We hope that this work serves to guide the implementation and comparative assessment of Bayesian and ABC sampling techniques in biological models.

Keywords: approximate Bayesian computation; cardiac modelling; identifiability; sequential Monte Carlo; summary statistics.

PubMed Disclaimer

Conflict of interest statement

We declare we have no competing interests.

Figures

Figure 1.
Figure 1.
Impact of the magnitude of the normalizing constant cɛt of equation (2.4) on posteriors produced by a rejection sampler. The plot (a) demonstrates a sample (Gaussian) acceptance kernel function, with three potential values for the normalizing constant shown relative to an empirical estimate of the mode formula image. Plots (b) show the true posterior (blue) along with empirical posteriors that would result from employing a sampler with an acceptance kernel normalized to a value greater than (i), slightly less than (ii) or much less than (iii) its true modal value.
Figure 2.
Figure 2.
Schematic of the Toni SMC sampler. ‘Weighting’ encompasses lines 20–22 of algorithm 2, where importance weights are assigned to each particle to allow for sampling the next estimate. ‘Rejection sampling’ encompasses lines 7–14, where samples are drawn from the previous estimate, perturbed according to the proposal distribution, then accepted or rejected (and subsequently resampled) according to the kernel function. (Online version in colour.)
Figure 3.
Figure 3.
Schematic of the Del Moral SMC sampler. Between rounds, particles with non-zero weights evolve according to independent Markov Chains (lines 22–25, algorithm 2) and weight calculation (lines 7–9) can happen concurrently. If the ESS falls below a critical fraction, ‘resampling’ occurs to repopulate the posterior estimate with uniformly weighted draws from the previous estimate (lines 16–21). (Online version in colour.)
Figure 4.
Figure 4.
Visual representation of the summary statistics used by the approximate samplers for the O'Hara–Rudy action potential model. Vertical lines represent the times (ms) used for the values of APD50 and APD90 (respectively, labelled), while horizontal lines represent the voltages (mV) used for the values of peak and rest potential (respectively, labelled). The red ‘x’ indicates the point at which dV/dt (mV ms−1) is greatest, i.e. the point of maximum upstroke velocity. (Online version in colour.)
Figure 5.
Figure 5.
Posteriors generated by the (a) Toni and (b) Del Moral samplers over the linear model when using a top-hat acceptance kernel. Vertical bars represent frequencies of binned values of a among particles in the posterior estimate, while the vertical red line represents the theoretical posterior mean calculated with equation (3.2). (Online version in colour.)
Figure 6.
Figure 6.
Effect of population size and α in the Del Moral sampler on posteriors produced over the linear model. Posteriors produced using 1000 (a) and 10 000 (b) particles at varying values of the algorithmic parameter α. Histograms parallel to the ‘a’ axis represent the frequencies of particles in the final posterior estimate generated under the corresponding value of α on the ‘alpha’ axis. Note that the a-axis has been shifted in both graphs, as noted in the bottom-left marking on each. (Online version in colour.)
Figure 7.
Figure 7.
Posterior generated by the Del Moral sampler over the linear model when using a Gaussian acceptance kernel. (a) Histogram of particle frequencies, with the red vertical line indicating the theoretical mean. (b) Quantile–quantile plot comparing the PPF of the estimated posterior (x-axis) to that of the theoretical posterior (y-axis) by paired estimation at 100 equally spaced quantiles. (Online version in colour.)
Figure 8.
Figure 8.
Effect of cɛt on posteriors generated by the Toni algorithm over the linear model with a Gaussian acceptance kernel. formula image formula image, where formula image is the maximum-likelihood estimate of the slope parameter. (a) Histogram of particle frequencies, with the red vertical line indicating the theoretical mean. (b) Quantile–quantile plot comparing the PPF of the estimated posterior (x-axis) to that of the theoretical posterior (y-axis) at 100 equally spaced quantiles. (Online version in colour.)
Figure 9.
Figure 9.
Effect of model dimension on stability of Toni and Del Moral samplers. Variance of particle weights (σw, plotted in log-scale) in the approximate Toni sampler (a) and both the approximate (b) and exact (c) Del Moral samplers, each with 1000 particles, as they iterate over posterior estimates of the polynomial model with increasing numbers of parameters (indicated by their ‘order’). Iteration of the algorithm is marked on the x-axis (log-scale) by the value of the algorithmic threshold parameter (ɛt).
Figure 10.
Figure 10.
Selected marginal distributions from the posterior produced by the exact Del Moral algorithm on the 13-parameter O'Hara–Rudy model. Inference was performed using full time trace data. Generating values for the simulated data are indicated by vertical red lines. (Online version in colour.)
Figure 11.
Figure 11.
Selected marginal distributions from the posterior produced by the approximate Del Moral algorithm on the 13-parameter O'Hara–Rudy model. Inference was performed using summary statistic data. Generating values for the simulated data are indicated by vertical red lines. (Online version in colour.)

Similar articles

Cited by

References

    1. Wilkinson DJ. 2011. Stochastic modelling for systems biology, 2nd edn Mathematical and Computational Biology London, UK: CRC.
    1. Smith N, Niederer S. 2016. Is computational modeling adding value for understanding the heart? J. Mol. Cell. Cardiol. 96, 1 (10.1016/j.yjmcc.2016.04.017) - DOI - PubMed
    1. Louch WE, Land S, Niederer S, Alexander S. 2014. Strange bedfellows: biologists and mathematical modelers tie the knot on cardiomyocyte calcium homeostasis. Drug Discov. Today 14, 11–16. (10.1016/j.ddmod.2014.05.001) - DOI
    1. Niederer S, Land S, Omholt SW, Smith NP. 2012. Interpreting genetic effects through models of cardiac electromechanics. Am. J. Physiol. 303, H1294–H1303. (10.1152/ajpheart.00121.2012) - DOI - PMC - PubMed
    1. Noble D, Rudy Y. 2001. Models of cardiac ventricular action potentials: iterative interactions between experiment and simulation. Phil. Trans. R. Soc. Lond. A 359, 1127–1142. (10.1098/rsta.2001.0820) - DOI

Publication types

LinkOut - more resources