Impact of censoring on learning Bayesian networks in survival modelling

Ivan Stajduhar¹, Bojana Dalbelo-Basić, Nikola Bogunović

Affiliations

Affiliation

¹ Department of Automation, Electronics and Computing, Faculty of Engineering, University of Rijeka, Vukovarska 58, 51000 Rijeka, Croatia. ivan.stajduhar@riteh.hr

PMID: 19833488
DOI: 10.1016/j.artmed.2009.08.001

Comparative Study

Impact of censoring on learning Bayesian networks in survival modelling

Ivan Stajduhar et al. Artif Intell Med. 2009 Nov.

. 2009 Nov;47(3):199-217.

doi: 10.1016/j.artmed.2009.08.001. Epub 2009 Oct 14.

Authors

Ivan Stajduhar¹, Bojana Dalbelo-Basić, Nikola Bogunović

Affiliation

¹ Department of Automation, Electronics and Computing, Faculty of Engineering, University of Rijeka, Vukovarska 58, 51000 Rijeka, Croatia. ivan.stajduhar@riteh.hr

PMID: 19833488
DOI: 10.1016/j.artmed.2009.08.001

Abstract

Objective: Bayesian networks are commonly used for presenting uncertainty and covariate interactions in an easily interpretable way. Because of their efficient inference and ability to represent causal relationships, they are an excellent choice for medical decision support systems in diagnosis, treatment, and prognosis. Although good procedures for learning Bayesian networks from data have been defined, their performance in learning from censored survival data has not been widely studied. In this paper, we explore how to use these procedures to learn about possible interactions between prognostic factors and their influence on the variate of interest. We study how censoring affects the probability of learning correct Bayesian network structures. Additionally, we analyse the potential usefulness of the learnt models for predicting the time-independent probability of an event of interest.

Methods and materials: We analysed the influence of censoring with a simulation on synthetic data sampled from randomly generated Bayesian networks. We used two well-known methods for learning Bayesian networks from data: a constraint-based method and a score-based method. We compared the performance of each method under different levels of censoring to those of the naive Bayes classifier and the proportional hazards model. We did additional experiments on several datasets from real-world medical domains. The machine-learning methods treated censored cases in the data as event-free.

Results: We report and compare results for several commonly used model evaluation metrics. On average, the proportional hazards method outperformed other methods in most censoring setups. As part of the simulation study, we also analysed structural similarities of the learnt networks. Heavy censoring, as opposed to no censoring, produces up to a 5% surplus and up to 10% missing total arcs. It also produces up to 50% missing arcs that should originally be connected to the variate of interest.

Conclusion: Presented methods for learning Bayesian networks from data can be used to learn from censored survival data in the presence of light censoring (up to 20%) by treating censored cases as event-free. Given intermediate or heavy censoring, the learnt models become tuned to the majority class and would thus require a different approach.

PubMed Disclaimer

Cited by

A probabilistic analysis of completely excised high-grade soft tissue sarcomas of the extremity: an application of a Bayesian belief network.
Forsberg JA, Healey JH, Brennan MF. Forsberg JA, et al. Ann Surg Oncol. 2012 Sep;19(9):2992-3001. doi: 10.1245/s10434-012-2345-z. Epub 2012 Apr 20. Ann Surg Oncol. 2012. PMID: 22526900 Free PMC article.
Learning rule sets from survival data.
Wróbel Ł, Gudyś A, Sikora M. Wróbel Ł, et al. BMC Bioinformatics. 2017 May 30;18(1):285. doi: 10.1186/s12859-017-1693-x. BMC Bioinformatics. 2017. PMID: 28558674 Free PMC article.
A Naive Bayes machine learning approach to risk prediction using censored, time-to-event data.
Wolfson J, Bandyopadhyay S, Elidrisi M, Vazquez-Benitez G, Vock DM, Musgrove D, Adomavicius G, Johnson PE, O'Connor PJ. Wolfson J, et al. Stat Med. 2015 Sep 20;34(21):2941-57. doi: 10.1002/sim.6526. Epub 2015 May 18. Stat Med. 2015. PMID: 25980520 Free PMC article.
A novel dynamic Bayesian network approach for data mining and survival data analysis.
Sheidaei A, Foroushani AR, Gohari K, Zeraati H. Sheidaei A, et al. BMC Med Inform Decis Mak. 2022 Sep 22;22(1):251. doi: 10.1186/s12911-022-02000-7. BMC Med Inform Decis Mak. 2022. PMID: 36138394 Free PMC article.
Application of machine learning algorithms for clinical predictive modeling: a data-mining approach in SCT.
Shouval R, Bondi O, Mishan H, Shimoni A, Unger R, Nagler A. Shouval R, et al. Bone Marrow Transplant. 2014 Mar;49(3):332-7. doi: 10.1038/bmt.2013.146. Epub 2013 Oct 7. Bone Marrow Transplant. 2014. PMID: 24096823 Review.

See all "Cited by" articles

Publication types

Actions
Actions
Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

LinkOut - more resources

Full Text Sources
- ClinicalKey
- Elsevier Science
Research Materials
- NCI CPTC Antibody Characterization Program

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Impact of censoring on learning Bayesian networks in survival modelling

Affiliation

Impact of censoring on learning Bayesian networks in survival modelling

Authors

Affiliation

Abstract

Similar articles

Cited by

Publication types

MeSH terms

LinkOut - more resources

Full Text Sources

Research Materials