Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Mar:115:103671.
doi: 10.1016/j.jbi.2020.103671. Epub 2020 Dec 31.

Deep representation learning of patient data from Electronic Health Records (EHR): A systematic review

Affiliations

Deep representation learning of patient data from Electronic Health Records (EHR): A systematic review

Yuqi Si et al. J Biomed Inform. 2021 Mar.

Abstract

Objectives: Patient representation learning refers to learning a dense mathematical representation of a patient that encodes meaningful information from Electronic Health Records (EHRs). This is generally performed using advanced deep learning methods. This study presents a systematic review of this field and provides both qualitative and quantitative analyses from a methodological perspective.

Methods: We identified studies developing patient representations from EHRs with deep learning methods from MEDLINE, EMBASE, Scopus, the Association for Computing Machinery (ACM) Digital Library, and the Institute of Electrical and Electronics Engineers (IEEE) Xplore Digital Library. After screening 363 articles, 49 papers were included for a comprehensive data collection.

Results: Publications developing patient representations almost doubled each year from 2015 until 2019. We noticed a typical workflow starting with feeding raw data, applying deep learning models, and ending with clinical outcome predictions as evaluations of the learned representations. Specifically, learning representations from structured EHR data was dominant (37 out of 49 studies). Recurrent Neural Networks were widely applied as the deep learning architecture (Long short-term memory: 13 studies, Gated recurrent unit: 11 studies). Learning was mainly performed in a supervised manner (30 studies) optimized with cross-entropy loss. Disease prediction was the most common application and evaluation (31 studies). Benchmark datasets were mostly unavailable (28 studies) due to privacy concerns of EHR data, and code availability was assured in 20 studies.

Discussion & conclusion: The existing predictive models mainly focus on the prediction of single diseases, rather than considering the complex mechanisms of patients from a holistic review. We show the importance and feasibility of learning comprehensive representations of patient EHR data through a systematic review. Advances in patient representation learning techniques will be essential for powering patient-level EHR analyses. Future work will still be devoted to leveraging the richness and potential of available EHR data. Reproducibility and transparency of reported results will hopefully improve. Knowledge distillation and advanced learning techniques will be exploited to assist the capability of learning patient representation further.

Keywords: Deep learning; Electronic health records; Patient representation; Systematic review.

PubMed Disclaimer

Conflict of interest statement

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Figures

Fig. 1.
Fig. 1.
Workflow of patient representation learning.
Fig. 2.
Fig. 2.
PRISMA flowchart for including articles in this review. (PRL: Patient Representation Learning. DL: Deep Learning. EHR: Electronic Health Records.)
Fig. 3.
Fig. 3.
Publication counts over time.
Fig. 4.
Fig. 4.
Deep learning architectures used in the included studies.

Similar articles

Cited by

References

    1. Wu J, Roy J, Stewart WF, Prediction modeling using EHR data: Challenges, strategies, and a comparison of machine learning approaches, Med. Care 48 (2010) S106–S113, 10.1097/MLR.0b013e3181de9e17. - DOI - PubMed
    1. Bengio Y, Courville A, Vincent P, Representation learning: A review and new perspectives, IEEE Trans. Pattern Anal. Mach. Intell 35 (2013) 1798–1828, 10.1109/TPAMI.2013.50. - DOI - PubMed
    1. LeCun Y, Bengio Y, Hinton G, Deep learning, Nature 521 (2015) 436–444, 10.1038/nature14539. - DOI - PubMed
    1. Svozil D, Kvasnicka V, Pospichal J, Introduction to multi-layer feed-forward neural networks, Chemomet. Intell. Lab. Syst 39 (1997) 43–62, 10.1016/S0169-7439(97)00061-0. - DOI
    1. Che Z, Kale D, Li W, Bahadori MT, Liu Y, Deep computational phenotyping, in: Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, ACM, 2015, pp. 507–516.

Publication types

LinkOut - more resources