Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Aug 14;18(8):e0289889.
doi: 10.1371/journal.pone.0289889. eCollection 2023.

Application of referenced thermodynamic integration to Bayesian model selection

Affiliations

Application of referenced thermodynamic integration to Bayesian model selection

Iwona Hawryluk et al. PLoS One. .

Abstract

Evaluating normalising constants is important across a range of topics in statistical learning, notably Bayesian model selection. However, in many realistic problems this involves the integration of analytically intractable, high-dimensional distributions, and therefore requires the use of stochastic methods such as thermodynamic integration (TI). In this paper we apply a simple but under-appreciated variation of the TI method, here referred to as referenced TI, which computes a single model's normalising constant in an efficient way by using a judiciously chosen reference density. The advantages of the approach and theoretical considerations are set out, along with pedagogical 1 and 2D examples. The approach is shown to be useful in practice when applied to a real problem -to perform model selection for a semi-mechanistic hierarchical Bayesian model of COVID-19 transmission in South Korea involving the integration of a 200D density.

PubMed Disclaimer

Conflict of interest statement

The authors have declared that no competing interests exist.

Figures

Fig 1
Fig 1. Illustration of steps from the 1D pedagogical example.
A) qλqref(1-λ) for the 1d example density in parameter θ (Eq 9) at selected λ values along the path. B) Expectation Eq(λ;θ)[logq(θ)qref(θ)] vs MCMC iteration, shown at each value of λ sampled. C) λ-dependence of Eq(λ;θ)[logq(θ)qref(θ)], the TI contribution to the log-evidence. D) Convergence of the evidence z, with 1% convergence after 500 iterations and 0.1% after 17, 000 iterations per λ.
Fig 2
Fig 2. Contour plots of un-normalised densities.
Contour plots of the un-normalised density q and its two reference densities qref, one using a full covariance matrix and another using a diagonal covariance matrix that can be easily marginalised. The red line shows the lower boundary θ1 = 0 and the shaded θ1 < 0 region to the left of the line is outside of the support of the density q.
Fig 3
Fig 3. The estimated log-evidence of M2 from the Radiata Pine benchmark problem.
The log-evidence of M2 from the Radiata Pine benchmark problem is shown estimated using three approaches. (A) shows the rolling mean of log-evidence of M2 over 1500 iterations per λ obtained by referenced TI (blue line) and PP100 (orange line) methods. The exact value is shown with red dashed line. (B) shows the mean log-evidence of the model M2 evaluated over 15 runs of the three algorithms. The exact value of the log-evidence is shown with the dotted line.
Fig 4
Fig 4. Time-dependent reproduction number.
Time-dependent reproduction number generated by models with the highest evidence calculated using the Laplace approximation (orange lines) and referenced TI (blue lines). Note, the fitting data in this example contains superspreading events (which leads to very high values of Rt on certain days) so is not representative of SARS-CoV-2 transmission generally.

References

    1. Kass RE, Raftery AE. Bayes Factors. Journal of the American Statistical Association. 1995;90(430):773–795. doi: 10.1080/01621459.1995.10476572 - DOI
    1. Bennett CH. Efficient estimation of free energy differences from Monte Carlo data. Journal of Computational Physics. 1976;22(2):245–268. doi: 10.1016/0021-9991(76)90078-4 - DOI
    1. Meng XL, Wong WH. Simulating rations of normalising constants via a simple identity: a theoretical exploration. Statistica Sinica. 1996;6(4):831–860.
    1. Skilling J. Nested Sampling for General Bayesian Computation. Bayesian Analysis. 2006;1(4):833–860. doi: 10.1214/06-BA127 - DOI
    1. Habeck M. Evaluation of marginal likelihoods via the density of states. In: Proceedings of the Fifteenth International Conference on Artificial Intelligence and Statistics. vol. 22 of Proceedings of Machine Learning Research. La Palma, Canary Islands: PMLR; 2012. p. 486–494. Available from: https://proceedings.mlr.press/v22/habeck12.html.

Publication types