. 2011 Dec:24:738-746.

Learning unbelievable probabilities

Xaq Pitkow¹, Yashar Ahmadian², Ken D Miller²

Affiliations

¹ Department of Brain and Cognitive Science, University of Rochester, Rochester, NY 14607.
² Center for Theoretical Neuroscience, Columbia University, New York, NY 10032.

PMID: 28781497
PMCID: PMC5543998

Learning unbelievable probabilities

Xaq Pitkow et al. Adv Neural Inf Process Syst. 2011 Dec.

. 2011 Dec:24:738-746.

Authors

Xaq Pitkow¹, Yashar Ahmadian², Ken D Miller²

Affiliations

¹ Department of Brain and Cognitive Science, University of Rochester, Rochester, NY 14607.
² Center for Theoretical Neuroscience, Columbia University, New York, NY 10032.

PMID: 28781497
PMCID: PMC5543998

Abstract

Loopy belief propagation performs approximate inference on graphical models with loops. One might hope to compensate for the approximation by adjusting model parameters. Learning algorithms for this purpose have been explored previously, and the claim has been made that every set of locally consistent marginals can arise from belief propagation run on a graphical model. On the contrary, here we show that many probability distributions have marginals that cannot be reached by belief propagation using any set of model parameters or any learning algorithm. We call such marginals 'unbelievable.' This problem occurs whenever the Hessian of the Bethe free energy is not positive-definite at the target marginals. All learning algorithms for belief propagation necessarily fail in these cases, producing beliefs or sets of beliefs that may even be worse than the pre-learning approximation. We then show that averaging inaccurate beliefs, each obtained from belief propagation using model parameters perturbed about some learned mean values, can achieve the unbelievable marginals.

PubMed Disclaimer

Figures

**Figure 1**
Landscape of Bethe free energy for the binary graphical model with pairwise interactions. (A) A slice through the Bethe free energy (solid lines) along one axis v₁ of pseudomarginal space, for three different values of parameters θ. The energy U is linear in the pseudomarginals (dotted lines), so varying the parameters only changes the tilt of the free energy. This can add or remove local minima. (B) The second derivatives of the free energies in (A) are all identical. Where the second derivative is positive, a local minimum can exist (cyan); where it is negative (yellow), no parameters can produce a local minimum. (C) A two-dimensional slice of the Bethe free energy, colored according to the minimum eigenvalue λ_min of the Bethe Hessian. During a run of Bethe wake-sleep learning, the beliefs (blue dots) proceed along v₂ toward the target marginals p. Stable fixed points of BP can exist only in the believable region (cyan), but the target p resides in an unbelievable region (yellow). As learning equilibrates, the fixed points jump between believable regions on either side of the unbelievable zone.

**Figure 2**
Averaging over variable couplings can produce marginals otherwise unreachable by belief propagation. (A) As learning proceeds, the Bethe wake-sleep algorithm causes parameters θ to converge on a discrete limit cycle when attempting to learn unbelievable marginals. (B) The same limit cycle, projected onto their first two principal components u₁ and u₂ of θ during the cycle. (C) The corresponding beliefs b during the limit cycle (blue circles), projected onto the first two principal components v₁ and v₂ of the trajectory through pseudomarginal space. Believable regions of pseudomarginal space are colored with cyan and the unbelievable regions with yellow, and inconsistent pseudomarginals are black. Over the limit cycle, the average beliefs b̄ (blue ×) are precisely equal to the target marginals p (black □). The average b̄ (red +) over many fixed points of BP (red dots) generated from randomly perturbed parameters θ̄ + δθ still produces a better approximation of the target marginals than any of the individual believable fixed points. (D) Even the best amongst several BP fixed points cannot match unbelievable marginals (black and grey). Ensemble BP leads to much improved performance (red and pink).

**Figure 3**
Performance in learning unbelievable marginals. (A) Fraction of marginals that are unbelievable. Marginals were generated from fully connected, 8-node binary models with random biases and pairwise couplings, $h_{i} ~ N (0, \frac{1}{3})$ and *J_ij* ~ 𝒩(0, σ_J). (B,C) Performance of five models on 370 unbelievable random target marginals (Section 3), measured with Bethe divergence *D_β*[p||b] (B) and Euclidean distance |p − b| (C). Target were generated as in (A) with $σ_{J} = \frac{1}{3}$ , and selected for unbelievability. Bars represent central quartiles, and white line indicates the median. The five models are: (i) BP on the graphical model that generated the target distribution, (ii) BP after parameters are set by pseudomoment matching, (*iii*) the beliefs with the best performance encountered during Bethe wake-sleep learning, (iv) eBP using exact parameters from the last 100 iterations of learning, and (v) eBP with gaussian-distributed parameters with the same first- and second-order statistics as iv.

See this image and copyright information in PMC

References

1. Cooper G. The computational complexity of probabilistic inference using bayesian belief networks. Artificial intelligence. 1990;42:393–405.
1. Pearl J. Probabilistic reasoning in intelligent systems: networks of plausible inference. Morgan Kaufmann Publishers; San Mateo CA: 1988.
1. Kschischang F, Frey B, Loeliger H. Factor graphs and the sum-product algorithm. IEEE Transactions on Information Theory. 2001;47:498–519.
1. Bishop C. Pattern recognition and machine learning. Springer; New York: 2006.
1. Wainwright M, Jordan M. Graphical models, exponential families, and variational inference. Foundations and Trends in Machine Learning. 2008;1:1–305.

Grants and funding

R01 EY011001/EY/NEI NIH HHS/United States

LinkOut - more resources

Full Text Sources
- Europe PubMed Central
- PubMed Central
Miscellaneous
- NCI CPTAC Assay Portal

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Learning unbelievable probabilities

Affiliations

Learning unbelievable probabilities

Authors

Affiliations

Abstract

Figures

References

Grants and funding

LinkOut - more resources

Full Text Sources

Miscellaneous