Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2005 Aug 1;21(15):3273-8.
doi: 10.1093/bioinformatics/bti505. Epub 2005 May 24.

Mining genetic epidemiology data with Bayesian networks I: Bayesian networks and example application (plasma apoE levels)

Affiliations

Mining genetic epidemiology data with Bayesian networks I: Bayesian networks and example application (plasma apoE levels)

Andrei S Rodin et al. Bioinformatics. .

Abstract

Motivation: The wealth of single nucleotide polymorphism (SNP) data within candidate genes and anticipated across the genome poses enormous analytical problems for studies of genotype-to-phenotype relationships, and modern data mining methods may be particularly well suited to meet the swelling challenges. In this paper, we introduce the method of Belief (Bayesian) networks to the domain of genotype-to-phenotype analyses and provide an example application.

Results: A Belief network is a graphical model of a probabilistic nature that represents a joint multivariate probability distribution and reflects conditional independences between variables. Given the data, optimal network topology can be estimated with the assistance of heuristic search algorithms and scoring criteria. Statistical significance of edge strengths can be evaluated using Bayesian methods and bootstrapping. As an example application, the method of Belief networks was applied to 20 SNPs in the apolipoprotein (apo) E gene and plasma apoE levels in a sample of 702 individuals from Jackson, MS. Plasma apoE level was the primary target variable. These analyses indicate that the edge between SNP 4075, coding for the well-known epsilon2 allele, and plasma apoE level was strong. Belief networks can effectively describe complex uncertain processes and can both learn from data and incorporate prior knowledge.

Availability: Various alternative and supplemental networks (not given in the text) as well as source code extensions, are available from the authors.

Supplementary information: http://bioinformatics.oxfordjournals.org.

PubMed Disclaimer

Conflict of interest statement

Conflict of Interest: none declared.

Figures

Fig. 1
Fig. 1
Learned BN relating apolipoprotein E gene SNPs to plasma apoE levels in Jackson, MS. Node legends: numbers refer to corresponding SNPs (see Figure 1 in Nickerson et al. (2000) for an SNP map.) APO_E, APO_A, APO_B, TRIG, CHOL and HDL stand for levels of apolipoproteins E, AI and B, triglycerides, cholesterol and HDL cholesterol, respectively. WEIGHT, GENDER, AGE and HEIGHT labels are self-explanatory. Undirected edges indicate dependencies, directed edges indicate possible causations. Line thickness corresponds to the relative edge strength (Table 1).

References

    1. Akaike, H. (1973) Information theory and an extension of the maximum likelihood principle. In Petrov, B.N. and Csaki, F. (eds), Proceedings of the 2nd International Symposium on Information Theory, Akademiai Kiado, Budapest, Hungary.
    1. Boerwinkle E, Utermann G. Simultaneous effects of the apolipoprotein E polymorphism on apolipoprotein E, apolipoprotein B, and cholesterol metabolism. Am J Hum Genet. 1988;42:104–112. - PMC - PubMed
    1. Boerwinkle E, et al. Apolipoprotein E polymorphism influences postprandial retinyl palmitate but not triglyceride concentrations. Am J Hum Genet. 1994;54:341–360. - PMC - PubMed
    1. Cooper G, Herskovits E. A Bayesian method for the induction of the probabilistic networks from data. Machine Learning. 1992;9:309–347.
    1. Dergunov AD, Rosseneu M. The significance of apolipoprotein E structure to the metabolism of plasma triglyceride-rich lipoproteins. Biol Chem Hoppe Seyler. 1994;375:485–495. - PubMed

Publication types

Substances