Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2020 Dec:226:158-166.
doi: 10.1016/j.schres.2020.04.032. Epub 2020 Jun 1.

Language as a biomarker for psychosis: A natural language processing approach

Affiliations
Review

Language as a biomarker for psychosis: A natural language processing approach

Cheryl M Corcoran et al. Schizophr Res. 2020 Dec.

Abstract

Human ratings of conceptual disorganization, poverty of content, referential cohesion and illogical thinking have been shown to predict psychosis onset in prospective clinical high risk (CHR) cohort studies. The potential value of linguistic biomarkers has been significantly magnified, however, by recent advances in natural language processing (NLP) and machine learning (ML). Such methodologies allow for the rapid and objective measurement of language features, many of which are not easily recognized by human raters. Here we review the key findings on language production disturbance in psychosis. We also describe recent advances in the computational methods used to analyze language data, including methods for the automatic measurement of discourse coherence, syntactic complexity, poverty of content, referential coherence, and metaphorical language. Linguistic biomarkers of psychosis risk are now undergoing cross-validation, with attention to harmonization of methods. Future directions in extended CHR networks include studies of sources of variance, and combination with other promising biomarkers of psychosis risk, such as cognitive and sensory processing impairments likely to be related to language. Implications for the broader study of social communication, including reciprocal prosody, face expression and gesture, are discussed.

Keywords: Automated language analysis; Clinical high risk; Digital phenotyping; Discourse coherence; Latent semantic analysis; Machine learning; Natural language processing; Psychosis; Psychosis risk; Referential coherence; Schizophrenia; Semantic coherence; Semantic density; Ultra high risk.

PubMed Disclaimer

Similar articles

Cited by

References

    1. Addington J, Liu L, Buchy L, Cadenhead KS, Cannon TD, Cornblatt BA, Perkins DO, Seidman LJ, Tsuang MT, Walker EF, Woods SW, Bearden CE, Mathalon DH, McGlashan TH 2015. North American Prodrome Longitudinal Study (NAPLS 2): The prodromal symptoms. J. Nerv. Ment. Dis 203, 328–335. doi:10.1097/NMD.0000000000000290 - DOI - PMC - PubMed
    1. Agurto C, Cecchi GA, Norel R, Ostrand R, Kirkpatrick M, Baggott MJ, Wardle MC, de Wit H, Bedi G. 2020. Neuropsychopharmacology. 45, 8230832. doi.10.1038/s41386-020-0620-4 - DOI - PMC - PubMed
    1. Andreasen NC, 1979. Thought, Language, and Communication Disorders: I. Clinical Assessment, Definition of Terms, and Evaluation of Their Reliability. Arch. Gen. Psychiatry 36, 1315–1321. doi:10.1001/archpsyc.1979.01780120045006 - DOI - PubMed
    1. Andreasen NC, 1979. Thought, Language, and Communication Disorders: II. Diagnostic Significance. Arch. Gen. Psychiatry 36, 1325–1330. doi:10.1001/archpsyc.1979.01780120055007 - DOI - PubMed
    1. Andreasen NC, 1986. Scale for the assessment of thought, language, and communication (TLC). Schizophr. Bull 12, 473–482. doi:10.1093/schbul/12.3.473 - DOI - PubMed

Publication types