Learning Bayesian Networks from Correlated Data
- PMID: 27146517
- PMCID: PMC4857179
- DOI: 10.1038/srep25156
Learning Bayesian Networks from Correlated Data
Abstract
Bayesian networks are probabilistic models that represent complex distributions in a modular way and have become very popular in many fields. There are many methods to build Bayesian networks from a random sample of independent and identically distributed observations. However, many observational studies are designed using some form of clustered sampling that introduces correlations between observations within the same cluster and ignoring this correlation typically inflates the rate of false positive associations. We describe a novel parameterization of Bayesian networks that uses random effects to model the correlation within sample units and can be used for structure and parameter learning from correlated data without inflating the Type I error rate. We compare different learning metrics using simulations and illustrate the method in two real examples: an analysis of genetic and non-genetic factors associated with human longevity from a family-based study, and an example of risk factors for complications of sickle cell anemia from a longitudinal study with repeated measures.
Figures







Similar articles
-
Impact of censoring on learning Bayesian networks in survival modelling.Artif Intell Med. 2009 Nov;47(3):199-217. doi: 10.1016/j.artmed.2009.08.001. Epub 2009 Oct 14. Artif Intell Med. 2009. PMID: 19833488
-
Comparison of Bayesian random-effects and traditional life expectancy estimations in small-area applications.Am J Epidemiol. 2012 Nov 15;176(10):929-37. doi: 10.1093/aje/kws152. Epub 2012 Oct 16. Am J Epidemiol. 2012. PMID: 23136165
-
Part 1. Statistical Learning Methods for the Effects of Multiple Air Pollution Constituents.Res Rep Health Eff Inst. 2015 Jun;(183 Pt 1-2):5-50. Res Rep Health Eff Inst. 2015. PMID: 26333238
-
A primer on Bayesian inference for biophysical systems.Biophys J. 2015 May 5;108(9):2103-13. doi: 10.1016/j.bpj.2015.03.042. Biophys J. 2015. PMID: 25954869 Free PMC article. Review.
-
Probabilistic logic methods and some applications to biology and medicine.J Comput Biol. 2012 Mar;19(3):316-36. doi: 10.1089/cmb.2011.0234. J Comput Biol. 2012. PMID: 22401592 Review.
Cited by
-
Cross-sectional study to predict subnational levels of health workers' knowledge about severe malaria treatment in Kenya.BMJ Open. 2022 Jan 5;12(1):e058511. doi: 10.1136/bmjopen-2021-058511. BMJ Open. 2022. PMID: 34987048 Free PMC article.
-
Family history of non-communicable diseases and the risk of cardiovascular-kidney-metabolic syndrome.Sci Rep. 2025 Jul 1;15(1):20710. doi: 10.1038/s41598-025-07316-8. Sci Rep. 2025. PMID: 40595151 Free PMC article.
-
Inferring personal intake recommendations of phosphorous and potassium for end-stage renal failure patients by simulating with Bayesian hierarchical multivariate model.PLoS One. 2024 Feb 6;19(2):e0291153. doi: 10.1371/journal.pone.0291153. eCollection 2024. PLoS One. 2024. PMID: 38319948 Free PMC article.
-
Multivariate variable selection in N-of-1 observational studies via additive Bayesian networks.PLoS One. 2024 Aug 26;19(8):e0305225. doi: 10.1371/journal.pone.0305225. eCollection 2024. PLoS One. 2024. PMID: 39186511 Free PMC article.
-
Novel Bayesian Networks for Genomic Prediction of Developmental Traits in Biomass Sorghum.G3 (Bethesda). 2020 Feb 6;10(2):769-781. doi: 10.1534/g3.119.400759. G3 (Bethesda). 2020. PMID: 31852730 Free PMC article.
References
-
- Friedman N., Linial M., Nachman I. & Pe’er D. Using bayesian networks to analyze expression data. Journal of Computational Biology 7, 601–20 (2000). - PubMed
-
- Lauritzen S. L. & Sheehan N. A. Graphical models for genetic analysis. Statistical Science 18, 489–514 (2004).
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Medical