A synthetic dataset of liver disorder patients
- PMID: 36747982
- PMCID: PMC9898618
- DOI: 10.1016/j.dib.2023.108921
A synthetic dataset of liver disorder patients
Abstract
The data in this article include 10,000 synthetic patients with liver disorders, characterized by 70 different variables, including clinical features, and patient outcomes, such as hospital admission or surgery. Patient data are generated, simulating as close as possible real patient data, using a publicly available Bayesian network describing a casual model for liver disorders. By varying the network parameters, we also generated an additional set of 500 patients with characteristics that deviated from the initial patient population. We provide an overview of the synthetic data generation process and the associated scripts for generating the cohorts. This dataset can be useful for the machine learning models training and validation, especially under the effect of dataset shift between training and testing sets.
Keywords: Bayesian network; Causal model; Dataset shift; Machine learning; Synthetic patients.
© 2023 The Authors. Published by Elsevier Inc.
Conflict of interest statement
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper. The authors declare the following financial interests/personal relationships which may be considered as potential competing interests: GN is a full employee of enGenome srl.
Figures
References
-
- A. Oniśko, M.J. Druzdzel, H. Wasyluk, A probabilistic causal model for diagnosis of liver disorders, in: Proceedings of the Seventh International Symposium on Intelligent Information Systems (IIS–98), Malbork, Poland, June 15–19, 1998, pp. 379–387
-
- Briganti G., Le Moine O. Artificial intelligence in medicine: today and tomorrow. Front. Med. (Lausanne) 2020;7(27) doi: 10.3389/fmed.2020.00027. https://www.frontiersin.org/articles/10.3389/fmed.2020.00027 Accessed: Oct. 28, 2022. [Online]. Available: PMID: 32118012; PMCID: PMC7012990. - DOI - DOI - PMC - PubMed
LinkOut - more resources
Full Text Sources
