Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2020 Mar 23;22(3):e16374.
doi: 10.2196/16374.

Learning Latent Space Representations to Predict Patient Outcomes: Model Development and Validation

Affiliations

Learning Latent Space Representations to Predict Patient Outcomes: Model Development and Validation

Subendhu Rongali et al. J Med Internet Res. .

Abstract

Background: Scalable and accurate health outcome prediction using electronic health record (EHR) data has gained much attention in research recently. Previous machine learning models mostly ignore relations between different types of clinical data (ie, laboratory components, International Classification of Diseases codes, and medications).

Objective: This study aimed to model such relations and build predictive models using the EHR data from intensive care units. We developed innovative neural network models and compared them with the widely used logistic regression model and other state-of-the-art neural network models to predict the patient's mortality using their longitudinal EHR data.

Methods: We built a set of neural network models that we collectively called as long short-term memory (LSTM) outcome prediction using comprehensive feature relations or in short, CLOUT. Our CLOUT models use a correlational neural network model to identify a latent space representation between different types of discrete clinical features during a patient's encounter and integrate the latent representation into an LSTM-based predictive model framework. In addition, we designed an ablation experiment to identify risk factors from our CLOUT models. Using physicians' input as the gold standard, we compared the risk factors identified by both CLOUT and logistic regression models.

Results: Experiments on the Medical Information Mart for Intensive Care-III dataset (selected patient population: 7537) show that CLOUT (area under the receiver operating characteristic curve=0.89) has surpassed logistic regression (0.82) and other baseline NN models (<0.86). In addition, physicians' agreement with the CLOUT-derived risk factor rankings was statistically significantly higher than the agreement with the logistic regression model.

Conclusions: Our results support the applicability of CLOUT for real-world clinical use in identifying patients at high risk of mortality.

Keywords: ablation; neural networks; patient mortality; predictive modeling.

PubMed Disclaimer

Conflict of interest statement

Conflicts of Interest: DM has received research grant support from Apple Computer, Bristol-Myers Squibb, Boehringher-Ingelheim, Pfizer, Samsung, Philips Healthcare, Care Evolution, and Biotronik; has received consultancy fees from Bristol-Myers Squibb, Pfizer, Flexcon, and Boston Biomedical Associates; and has inventor equity in Mobile Sense Technologies, Inc, Connecticut.

Figures

Figure 1
Figure 1
Our model architecture. LSTM: long short-term memory.
Figure 2
Figure 2
Model for constructing the encounter vector. ReLU: rectified linear unit; ICD: International Classification of Diseases.
Figure 3
Figure 3
The correlational neural network for our 3 views. ICD: International Classification of Diseases.
Figure 4
Figure 4
The area under the receiver operating characteristic curves for various models. RETAIN: Reverse Time Attention model; CLOUT: L(STM) Outcome prediction using Comprehensive feature relations.

Similar articles

Cited by

References

    1. Kharrazi H, Gonzalez C, Lowe K, Huerta T, Ford E. Forecasting the maturation of electronic health record functions among US hospitals: retrospective analysis and predictive model. J Med Internet Res. 2018;20(8):e10458. doi: 10.2196/preprints.10458. preprint. - DOI - PMC - PubMed
    1. Jha AK, DesRoches CM, Campbell EG, Donelan K, Rao SR, Ferris TG, Shields A, Rosenbaum S, Blumenthal D. Use of electronic health records in US hospitals. N Engl J Med. 2009 Apr 16;360(16):1628–38. doi: 10.1056/NEJMsa0900592. - DOI - PubMed
    1. Choi E, Bahadori MT, Schuetz A, Stewart WF, Sun J. Doctor AI: Predicting clinical events via recurrent neural networks. JMLR Workshop Conf Proc. 2016 Aug;56:301–18. http://europepmc.org/abstract/MED/28286600 - PMC - PubMed
    1. Miotto R, Li L, Kidd BA, Dudley JT. Deep patient: an unsupervised representation to predict the future of patients from the electronic health records. Sci Rep. 2016 May 17;6:26094. doi: 10.1038/srep26094. doi: 10.1038/srep26094. - DOI - DOI - PMC - PubMed
    1. Zhou J, Wang F, Hu J, Ye J. From Micro to Macro: Data Driven Phenotyping by Densification of Longitudinal Electronic Medical Records. Proceedings of the 20th ACM SIGKDD international conference on Knowledge discovery and data mining; KDD'14; August 24 - 27, 2014; New York, NY, USA. Association for Computing Machinery, New York, NY, United States; 2014. pp. 135–44. - DOI

Publication types

LinkOut - more resources