Model-based machine learning

doi:10.1098/rsta.2012.0222

. 2012 Dec 31;371(1984):20120222.

doi: 10.1098/rsta.2012.0222. Print 2013 Feb 13.

Model-based machine learning

Christopher M Bishop¹

Affiliations

PMID: 23277612
PMCID: PMC3538442
DOI: 10.1098/rsta.2012.0222

Model-based machine learning

Christopher M Bishop. Philos Trans A Math Phys Eng Sci. 2012.

. 2012 Dec 31;371(1984):20120222.

doi: 10.1098/rsta.2012.0222. Print 2013 Feb 13.

Author

Christopher M Bishop¹

Affiliation

¹ Microsoft Research, Cambridge CB3 0FB, UK. christopher.bishop@microsoft.com

PMID: 23277612
PMCID: PMC3538442
DOI: 10.1098/rsta.2012.0222

Abstract

Several decades of research in the field of machine learning have resulted in a multitude of different algorithms for solving a broad range of problems. To tackle a new application, a researcher typically tries to map their problem onto one of these existing methods, often influenced by their familiarity with specific algorithms and by the availability of corresponding software implementations. In this study, we describe an alternative methodology for applying machine learning, in which a bespoke solution is formulated for each new application. The solution is expressed through a compact modelling language, and the corresponding custom machine learning code is then generated automatically. This model-based approach offers several major advantages, including the opportunity to create highly tailored models for specific scenarios, as well as rapid prototyping and comparison of a range of alternative models. Furthermore, newcomers to the field of machine learning do not have to learn about the huge range of traditional methods, but instead can focus their attention on understanding a single modelling environment. In this study, we show how probabilistic graphical models, coupled with efficient inference algorithms, provide a very flexible foundation for model-based machine learning, and we outline a large-scale commercial application of this framework involving tens of millions of users. We also describe the concept of probabilistic programming as a powerful software environment for model-based machine learning, and we discuss a specific probabilistic programming language called Infer.NET, which has been widely used in practical applications.

PubMed Disclaimer

Figures

**Figure 1.**
A neural network with two layers of adjustable parameters, in which each parameter corresponds to one of the links in the network. (Online version in colour.)

**Figure 2.**
A directed graphical model representing the joint probability distribution over three variables a, b and c, corresponding to the decomposition on the right-hand side of (4.2). (Online version in colour.)

**Figure 3.**
A directed acyclic graph over seven variables. This graph expresses a decomposition of the joint distribution given by (4.3). (Online version in colour.)

**Figure 4.**
Graphical model representation of a hidden Markov model. This same graph also represents a linear dynamical system. Here, the shaded nodes represent *observed* variables, i.e. ones whose values are fixed by the dataset. (Online version in colour.)

**Figure 5.**
An extension of the model in figure 4 to include auto-regressive dependencies. (Online version in colour.)

**Figure 6.**
An extension of the model in figure 4 to include input variables as well as outputs. (Online version in colour.)

**Figure 7.**
An extension of the model in figure 4 for multiple hidden Markov processes. (Online version in colour.)

**Figure 8.**
A simple Markov chain of variables. (Online version in colour.)

**Figure 9.**
Directed graph showing the TrueSkill model for a single game between two players. See the text for details. (Online version in colour.)

**Figure 10.**
Graph of skill levels for two players in an online game, showing the much faster convergence obtained using TrueSkill compared to the traditional Elo algorithm. (Online version in colour.)

**Figure 11.**
Modified skill rating graph showing the inclusion of three teams A, B and C, in which team B has two players. (Online version in colour.)

**Figure 12.**
Csoft code for the TrueSkill model.

**Figure 13.**
Flow diagram showing the operation of *Infer.NET*. (Online version in colour.)

See this image and copyright information in PMC

Cited by

Decoding Drug Discovery: Exploring A-to-Z In Silico Methods for Beginners.
Rasul HO, Ghafour DD, Aziz BK, Hassan BA, Rashid TA, Kivrak A. Rasul HO, et al. Appl Biochem Biotechnol. 2025 Mar;197(3):1453-1503. doi: 10.1007/s12010-024-05110-2. Epub 2024 Dec 4. Appl Biochem Biotechnol. 2025. PMID: 39630336 Review.
Machine learning to detect signatures of disease in liquid biopsies - a user's guide.
Ko J, Baldassano SN, Loh PL, Kording K, Litt B, Issadore D. Ko J, et al. Lab Chip. 2018 Jan 30;18(3):395-405. doi: 10.1039/c7lc00955k. Lab Chip. 2018. PMID: 29192299 Free PMC article. Review.
Atopic Dermatitis and Respiratory Allergy: What is the Link.
Belgrave DC, Simpson A, Buchan IE, Custovic A. Belgrave DC, et al. Curr Dermatol Rep. 2015;4(4):221-227. doi: 10.1007/s13671-015-0121-6. Epub 2015 Sep 28. Curr Dermatol Rep. 2015. PMID: 26566461 Free PMC article. Review.
Smartphone as a Sensor in mHealth: Narrative Overview, SWOT Analysis, and Proposal of Mobile Biomarkers.
Antonini A, Coşar S, Naja I, Haleem MS, Macdonald JH, Innominato P, Barresi G. Antonini A, et al. Sensors (Basel). 2025 Jun 11;25(12):3655. doi: 10.3390/s25123655. Sensors (Basel). 2025. PMID: 40573543 Free PMC article. Review.
Docking-generated multiple ligand poses for bootstrapping bioactivity classifying Machine Learning: Repurposing covalent inhibitors for COVID-19-related TMPRSS2 as case study.
Hatmal MM, Abuyaman O, Taha M. Hatmal MM, et al. Comput Struct Biotechnol J. 2021;19:4790-4824. doi: 10.1016/j.csbj.2021.08.023. Epub 2021 Aug 19. Comput Struct Biotechnol J. 2021. PMID: 34426763 Free PMC article.

See all "Cited by" articles

References

1. Bishop CM. 2005. Pattern recognition and machine learning. Berlin, Germany: Springer
1. Shotton J, Fitzgibbon A, Cook M, Sharp T, Finocchio M, Moore R, Kipman A, Blake A. 2011. Real-time human pose recognition in parts a single depth image. In IEEE Int. Conf. Computer Vision and Pattern Recognition, Barcelona, Spain, November 2011, pp. 415–422 IEEE
1. Pearl J. 1988. Probabilistic reasoning in intelligent systems. San Francisco, CA: Morgan Kaufmann
1. Koller D, Friedman N. 2009. Probabilistic graphical models: principles and techniques. Cambridge, MA: MIT Press
1. Rabiner LR. 1989. A tutorial on hidden Markov models and selected applications in speech recognition. Proc. IEEE 77, 257–28510.1109/5.18626 (doi:10.1109/5.18626) - DOI - DOI

LinkOut - more resources

Full Text Sources
Other Literature Sources
- The Lens - Patent Citations Database
- scite Smart Citations

[1] Bishop CM. 2005. Pattern recognition and machine learning. Berlin, Germany: Springer

[2] Bishop CM. 2005. Pattern recognition and machine learning. Berlin, Germany: Springer

[3] Shotton J, Fitzgibbon A, Cook M, Sharp T, Finocchio M, Moore R, Kipman A, Blake A. 2011. Real-time human pose recognition in parts a single depth image. In IEEE Int. Conf. Computer Vision and Pattern Recognition, Barcelona, Spain, November 2011, pp. 415–422 IEEE

[4] Shotton J, Fitzgibbon A, Cook M, Sharp T, Finocchio M, Moore R, Kipman A, Blake A. 2011. Real-time human pose recognition in parts a single depth image. In IEEE Int. Conf. Computer Vision and Pattern Recognition, Barcelona, Spain, November 2011, pp. 415–422 IEEE

[5] Pearl J. 1988. Probabilistic reasoning in intelligent systems. San Francisco, CA: Morgan Kaufmann

[6] Pearl J. 1988. Probabilistic reasoning in intelligent systems. San Francisco, CA: Morgan Kaufmann

[7] Koller D, Friedman N. 2009. Probabilistic graphical models: principles and techniques. Cambridge, MA: MIT Press

[8] Koller D, Friedman N. 2009. Probabilistic graphical models: principles and techniques. Cambridge, MA: MIT Press

[9] Rabiner LR. 1989. A tutorial on hidden Markov models and selected applications in speech recognition. Proc. IEEE 77, 257–28510.1109/5.18626 (doi:10.1109/5.18626) - DOI - DOI

[10] Rabiner LR. 1989. A tutorial on hidden Markov models and selected applications in speech recognition. Proc. IEEE 77, 257–28510.1109/5.18626 (doi:10.1109/5.18626) - DOI - DOI

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Model-based machine learning

Affiliation

Model-based machine learning

Author

Affiliation

Abstract

Figures

Similar articles

Cited by

References

LinkOut - more resources

Full Text Sources

Other Literature Sources