Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2011 Oct;44(5):859-68.
doi: 10.1016/j.jbi.2011.05.004. Epub 2011 May 27.

Combining PubMed knowledge and EHR data to develop a weighted bayesian network for pancreatic cancer prediction

Affiliations

Combining PubMed knowledge and EHR data to develop a weighted bayesian network for pancreatic cancer prediction

Di Zhao et al. J Biomed Inform. 2011 Oct.

Abstract

In this paper, we propose a novel method that combines PubMed knowledge and Electronic Health Records to develop a weighted Bayesian Network Inference (BNI) model for pancreatic cancer prediction. We selected 20 common risk factors associated with pancreatic cancer and used PubMed knowledge to weigh the risk factors. A keyword-based algorithm was developed to extract and classify PubMed abstracts into three categories that represented positive, negative, or neutral associations between each risk factor and pancreatic cancer. Then we designed a weighted BNI model by adding the normalized weights into a conventional BNI model. We used this model to extract the EHR values for patients with or without pancreatic cancer, which then enabled us to calculate the prior probabilities for the 20 risk factors in the BNI. The software iDiagnosis was designed to use this weighted BNI model for predicting pancreatic cancer. In an evaluation using a case-control dataset, the weighted BNI model significantly outperformed the conventional BNI and two other classifiers (k-Nearest Neighbor and Support Vector Machine). We conclude that the weighted BNI using PubMed knowledge and EHR data shows remarkable accuracy improvement over existing representative methods for pancreatic cancer prediction.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Steps to construct a node-weighted BNI
Figure 2
Figure 2
Class hierarchy of pancreatic cancer variables
Figure 3
Figure 3
The topology of the Bayesian Network for predicting pancreatic cancer
Figure 4
Figure 4
A simplified Bayesian Network for pancreatic cancer prediction: (a) two-node Bayesian Network (b) three-node Bayesian Network, where Vup is the set of all parent nodes of PC and Vdown is the set of all child nodes of pancreatic cancer.
Figure 5
Figure 5
The interfaces of iDiagnosis for (a) the Bayesian function and (b) the eUtils function.
Figure 6
Figure 6
Comparison of ROC curves of the weighted BNI, the conventional BNI, KNN and SVM: the weighted BNI model is more accurate than the conventional BNI, KNN and SVM for pancreatic cancer prediction.

Similar articles

Cited by

References

    1. Chakraborty S, Baine MJ, Sasson AR, Batra SK. Current status of molecular markers for early detection of sporadic pancreatic cancer. BBA-Rev Cancer. 2011;1815(1):44–64. - PMC - PubMed
    1. Verma M. Pancreatic cancer biomarkers and their implication in cancer diagnosis and epidemiology. Cancers. 2010;2(4):1830–1837. - PMC - PubMed
    1. Gerstung M, Baudis M, Moch H, Beerenwinkel N. Quantifying cancer progression with conjunctive Bayesian Networks. Bioinform. 2009;25(21):2809–2815. - PMC - PubMed
    1. Kim DJ, Rockhill B, Colditz GA. Validation of the Harvard Cancer Risk Index: a prediction tool for individual cancer risk. J Clin Epidemiol. 2004;57(4):332–340. - PubMed
    1. Heckerman D. A tutorial on learning with Bayesian Networks. Microsoft Research Tech Report. 1995:57.

Publication types