Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2017 Aug;20(3):83-87.
doi: 10.1136/eb-2017-102688. Epub 2017 Jul 24.

Applying deep neural networks to unstructured text notes in electronic medical records for phenotyping youth depression

Affiliations

Applying deep neural networks to unstructured text notes in electronic medical records for phenotyping youth depression

Joseph Geraci et al. Evid Based Ment Health. 2017 Aug.

Abstract

Background: We report a study of machine learning applied to the phenotyping of psychiatric diagnosis for research recruitment in youth depression, conducted with 861 labelled electronic medical records (EMRs) documents. A model was built that could accurately identify individuals who were suitable candidates for a study on youth depression.

Objective: Our objective was a model to identify individuals who meet inclusion criteria as well as unsuitable patients who would require exclusion.

Methods: Our methods included applying a system that coded the EMR documents by removing personally identifying information, using two psychiatrists who labelled a set of EMR documents (from which the 861 came), using a brute force search and training a deep neural network for this task.

Findings: According to a cross-validation evaluation, we describe a model that had a specificity of 97% and a sensitivity of 45% and a second model with a specificity of 53% and a sensitivity of 89%. We combined these two models into a third one (sensitivity 93.5%; specificity 68%; positive predictive value (precision) 77%) to generate a list of most suitable candidates in support of research recruitment.

Conclusion: Our efforts are meant to demonstrate the potential for this type of approach for patient recruitment purposes but it should be noted that a larger sample size is required to build a truly reliable recommendation system.

Clinical implications: Future efforts will employ alternate neural network algorithms available and other machine learning methods.

Keywords: deep learning; depression; neural network; phenotyping; youth.

PubMed Disclaimer

Conflict of interest statement

Competing interests: None declared

Figures

Figure 1
Figure 1
The more sensitive DL1 method was initially applied. Following DL1, the more specific DL0 model was then used on the documents selected with DL1. DL, deep learning paradigm.
Figure 2
Figure 2
A typical receiver operating characteristic (ROC) curve for DL0 models derived from a fivefold cross validation. The reason the area under the ROC (AUC) curve is relatively high compared with the AUC for DL1 is because there are a large number of true 0s captured by this model. DL, deep learning paradigm.
Figure 3
Figure 3
A typical receiver operating characteristic (ROC) curve for DL1 models derived from a fivefold cross-validation. The number of true 0s and true 1s in the data set used to train DL1 is balanced and thus the area under the ROC curve is quite poor despite the fact that this model is excellent at predicting true 1s. DL, deep learning paradigm.

References

    1. Carlson RW, Tu SW, Lane NM, et al. . Computer-based screening of patients with HIV/AIDS for clinical-trial eligibility. Online J Curr Clin Trials 1995;Doc No 179:179. - PubMed
    1. Fink E, Kokku PK, Nikiforou S, et al. . Selection of patients for clinical trials: an interactive web-based system. Artif Intell Med 2004;31:241–54.10.1016/j.artmed.2004.01.017 - DOI - PubMed
    1. Kitterman DR, Cheng SK, Dilts DM, et al. . The prevalence and economic impact of low-enrolling clinical studies at an academic medical center. Acad Med 2011;86:1360–6.10.1097/ACM.0b013e3182306440 - DOI - PMC - PubMed
    1. Schmickl CN, Li M, Li G, et al. . The accuracy and efficiency of electronic screening for recruitment into a clinical trial on COPD. Respir Med 2011;105:1501–6.10.1016/j.rmed.2011.04.012 - DOI - PMC - PubMed
    1. Weng C, Batres C, Borda T, et al. . A real-time screening alert improves patient recruitment efficiency. AMIA Annu Symp Proc 2011;2011:1489–98. - PMC - PubMed

Publication types