Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Mar 16;47(2):405-414.
doi: 10.1093/schbul/sbaa126.

Using Natural Language Processing on Electronic Health Records to Enhance Detection and Prediction of Psychosis Risk

Affiliations

Using Natural Language Processing on Electronic Health Records to Enhance Detection and Prediction of Psychosis Risk

Jessica Irving et al. Schizophr Bull. .

Erratum in

Abstract

Background: Using novel data mining methods such as natural language processing (NLP) on electronic health records (EHRs) for screening and detecting individuals at risk for psychosis.

Method: The study included all patients receiving a first index diagnosis of nonorganic and nonpsychotic mental disorder within the South London and Maudsley (SLaM) NHS Foundation Trust between January 1, 2008, and July 28, 2018. Least Absolute Shrinkage and Selection Operator (LASSO)-regularized Cox regression was used to refine and externally validate a refined version of a five-item individualized, transdiagnostic, clinically based risk calculator previously developed (Harrell's C = 0.79) and piloted for implementation. The refined version included 14 additional NLP-predictors: tearfulness, poor appetite, weight loss, insomnia, cannabis, cocaine, guilt, irritability, delusions, hopelessness, disturbed sleep, poor insight, agitation, and paranoia.

Results: A total of 92 151 patients with a first index diagnosis of nonorganic and nonpsychotic mental disorder within the SLaM Trust were included in the derivation (n = 28 297) or external validation (n = 63 854) data sets. Mean age was 33.6 years, 50.7% were women, and 67.0% were of white race/ethnicity. Mean follow-up was 1590 days. The overall 6-year risk of psychosis in secondary mental health care was 3.4 (95% CI, 3.3-3.6). External validation indicated strong performance on unseen data (Harrell's C 0.85, 95% CI 0.84-0.86), an increase of 0.06 from the original model.

Conclusions: Using NLP on EHRs can considerably enhance the prognostic accuracy of psychosis risk calculators. This can help identify patients at risk of psychosis who require assessment and specialized care, facilitating earlier detection and potentially improving patient outcomes.

Keywords: electronic health records; machine learning; natural language processing; prediction; prevention; psychosis.

PubMed Disclaimer

Figures

Fig. 1.
Fig. 1.
Flowchart of study population.

References

    1. GBD 2017 Disease and Injury Incidence and Prevalence Collaborators SL, Abate D, Abate KH, et al. Global, regional, and national incidence, prevalence, and years lived with disability for 354 diseases and injuries for 195 countries and territories, 1990–2017: a systematic analysis for the Global Burden of Disease Study 2017. Lancet (London, England). 2018;392:1789–1858. - PMC - PubMed
    1. Gustavsson A, Svensson M, Jacobi F, et al. Cost of disorders of the brain in Europe 2010. Eur Neuropsychopharmacol. 2011;21(10):718–779. - PubMed
    1. Jääskeläinen E, Juola P, Hirvonen N, et al. A systematic review and meta-analysis of recovery in schizophrenia. Schizophr Bull. 2013;39(6):1296–1306. - PMC - PubMed
    1. Millan MJ, Andrieux A, Bartzokis G, et al. Altering the course of schizophrenia: progress and perspectives. Nat Rev Drug Discov. 2016;15(7):485–515. - PubMed
    1. Fusar-Poli P, Bauer M, Borgwardt S, et al. European college of neuropsychopharmacology network on the prevention of mental disorders and mental health promotion (ECNP PMD-MHP). Eur Neuropsychopharmacol. 2019;29(12):1301–1311. - PubMed

Publication types