Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Jan 24;9(2):e13200.
doi: 10.1016/j.heliyon.2023.e13200. eCollection 2023 Feb.

Mortality prediction among ICU inpatients based on MIMIC-III database results from the conditional medical generative adversarial network

Affiliations

Mortality prediction among ICU inpatients based on MIMIC-III database results from the conditional medical generative adversarial network

Wei Yang et al. Heliyon. .

Abstract

Background and aims: Improved mortality prediction among intensive care unit (ICU) inpatients is a valuable and challenging task. Limited clinical data, especially with appropriate labels, are an important element restricting accurate predictions. Generative adversarial networks (GANs) are excellent generative models and have shown great potential for data simulation. However, there have been no relevant studies using GANs to predict mortality among ICU inpatients. In this study, we aim to evaluate the predictive performance of a variant of GAN called conditional medical GAN (c-med GAN) compared with some baseline models, including simplified acute physiology score II (SAPS II), support vector machine (SVM), and multilayer perceptron (MLP).

Methods: Data from a publicly available intensive care database, the Medical Information Mart for Intensive Care III (MIMIC-III) database (v1.4), were included in this study. The area under the precision-recall curve (PR-AUC), area under the receiver operating characteristic curve (ROC-AUC), and F1 score were used to evaluate the predictive performance. In addition, the size of the dataset was artificially reduced, and the performance of the c-med GAN was compared in different size datasets.

Results: The results showed that c-med GAN achieves the best PR-AUC, ROC-AUC, and F1 score compared with SAPS II, SVM, and MLP when training in the full MIMIC-III dataset. When the size of the dataset was reduced, the prediction performances of both MLP and c-med GAN were affected. However, the c-med GAN still outperformed MLP on smaller datasets and had less degradation.

Conclusion: The prediction of in-hospital mortality based on the c-med GAN for ICU patients showed better performance than the baseline models. Despite some inadequacies, this model may have a promising future in clinical applications which will be explored by further research.

Keywords: C-med GAN; GAN; MIMIC-III; Mortality prediction; ROC-AUC.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest.

Figures

Fig. 1
Fig. 1
Multilayer perceptron.
Fig. 2
Fig. 2
GAN and conditional medical GAN (c-med GAN). (A) Basic GAN. The generator (G) network generates fake data from the random variable Z. The discriminator (D) network distinguishes between fake data and real data. In constant iterations, G is able to generate fake data that even D cannot distinguish from the real data. (B) Autoencoder network in the c-med GAN. The features and labels were extracted separately from the original data and put into the encoder module to produce the intermediate vector V. The intermediate vector V and labels were then put into the decoder module for training. The goal of training is to reproduce the output features identical to the input features. (C) Adversarial networks in the c-med GAN. G first generated variables V', with the same dimensionality as the intermediate variables V in the autoencoder, from the random variable Z. Then, V' was put into the trained decoder module together with the given labels to generate the fake features. Eventually, the given labels and fake features were put into D along with the labels and features of the real data to discriminate their authenticity. The goal of training is to enable G to generate fake data with labels that D cannot distinguish from the real data.
Fig. 3
Fig. 3
The workflow the of pipeline used in this study.
Fig. 4
Fig. 4
Comparison of some basic characters between real and fake data in datasets. The fake data were generated by thec-med GAN. The small and medium datasets included 10% and 50% of patients in the full dataset, respectively.
Fig. 5
Fig. 5
Comparison of the predictive performance between different models. A. Receiver operating characteristic curve, B. precision-recall curve, C. calibration curve.
Fig. 6
Fig. 6
Comparison between the multilayer perceptron and c-med GAN in the different size datasets. A. receiver operating characteristic curve in the full dataset, B. receiver operating characteristic curve in the small dataset, C. receiver operating characteristic curve in the medium dataset. D. precision-recall curve in the full dataset, E. precision-recall curve in the small dataset, F. precision-recall curve in the medium dataset.

Similar articles

Cited by

References

    1. Knaus W.A., Zimmerman J.E., Wagner D.P., Draper E.A., Lawrence D.E. Apache-acute physiology and chronic health evaluation: a physiologically based classification system. Crit. Care Med. 1981;9(8):591–597. - PubMed
    1. Le Gall J.R., Loirat P., Alperovitch A., Glaser P., Granthil C., Mathieu D., Mercier P., Thomas R., Villers D. A simplified acute physiology score for ICU patients. Crit. Care Med. 1984;12(11):975–977. - PubMed
    1. Vincent J.L., Moreno R., Takala J., Willatts S., De Mendonça A., Bruining H., Reinhart C.K., Suter P.M., Thijs L.G. The SOFA (Sepsis-related organ failure assessment) score to describe organ dysfunction/failure. On behalf of the working Group on sepsis-related problems of the European society of intensive care medicine. Intensive Care Med. 1996;22(7):707–710. - PubMed
    1. Knaus W.A., Draper E.A., Wagner D.P., Zimmerman J.E. Apache II: a severity of disease classification system. Crit. Care Med. 1985;13(10):818–829. - PubMed
    1. Poole D., Rossi C., Latronico N., Rossi G., Finazzi S., Bertolini G. Comparison between SAPS II and SAPS 3 in predicting hospital mortality in a cohort of 103 Italian ICUs. Is new always better? Intensive Care Med. 2012;38(8):1280–1288. - PubMed