Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2020 Jan 15;15(1):e0225695.
doi: 10.1371/journal.pone.0225695. eCollection 2020.

Clinical state tracking in serious mental illness through computational analysis of speech

Affiliations

Clinical state tracking in serious mental illness through computational analysis of speech

Armen C Arevian et al. PLoS One. .

Abstract

Individuals with serious mental illness experience changes in their clinical states over time that are difficult to assess and that result in increased disease burden and care utilization. It is not known if features derived from speech can serve as a transdiagnostic marker of these clinical states. This study evaluates the feasibility of collecting speech samples from people with serious mental illness and explores the potential utility for tracking changes in clinical state over time. Patients (n = 47) were recruited from a community-based mental health clinic with diagnoses of bipolar disorder, major depressive disorder, schizophrenia or schizoaffective disorder. Patients used an interactive voice response system for at least 4 months to provide speech samples. Clinic providers (n = 13) reviewed responses and provided global assessment ratings. We computed features of speech and used machine learning to create models of outcome measures trained using either population data or an individual's own data over time. The system was feasible to use, recording 1101 phone calls and 117 hours of speech. Most (92%) of the patients agreed that it was easy to use. The individually-trained models demonstrated the highest correlation with provider ratings (rho = 0.78, p<0.001). Population-level models demonstrated statistically significant correlations with provider global assessment ratings (rho = 0.44, p<0.001), future provider ratings (rho = 0.33, p<0.05), BASIS-24 summary score, depression sub score, and self-harm sub score (rho = 0.25,0.25, and 0.28 respectively; p<0.05), and the SF-12 mental health sub score (rho = 0.25, p<0.05), but not with other BASIS-24 or SF-12 sub scores. This study brings together longitudinal collection of objective behavioral markers along with a transdiagnostic, personalized approach for tracking of mental health clinical state in a community-based clinical setting.

PubMed Disclaimer

Conflict of interest statement

AA has a financial interest in Insight Health Systems, Inc., Arevian Technologies Inc. AA has a family relationship to Memorial Psychiatric Health Services, a company that provides psychiatric services for the R.O.A.D.S. Foundation clinic where participants were recruited. SN is Chief Scientist at Behavioral Signal Technologies and Lyssn.io. There are no patents, products in development or marketed products to declare. The above interests do not alter our adherence to PLOS ONE policies on sharing data and materials. The specific roles of the authors are articulated in the ‘author contributions’ section.

Figures

Fig 1
Fig 1. Overview of longitudinal assessment and modeling methods.
(A) The MyCoachConnect (MCC) system used to collect speech samples from patients calling into an interactive voice response application. Their providers then used a web application to review speech samples and submit global assessment ratings for each call (B) Comparison of two training methods used. The population-based machine learning model was trained using data from all participants in the study, excluding the test participant. Individualized machine learning model trained on participant’s own data, excluding the test speech sample.
Fig 2
Fig 2. Patient-specific correlation patterns for speech features.
Correlation patterns between speech features and provider global assessment ratings for the top 25 features with the highest average correlation at the population level.
Fig 3
Fig 3. Covariance of speech features and clinical state over time.
(A) An example of clinical state (provider global assessment rating, black line) transitions within an individual patient over time compared to the individual’s highest performing linguistic feature (word count per speech sample, dotted grey line) for each call to the MCC system. (B) Increased correlations between clinical state and speech features over time highlighted through percent of maximal change of 8-period moving averages for provider rating (black line), word count (dotted grey line), and verbal pause percent (dotted light grey line) for the same patient and period.

References

    1. Hedden SL. Behavioral health trends in the United States: results from the 2014 National Survey on Drug Use and Health: Substance Abuse and Mental Health Services Administration, Department of Heath & Human Services; 2015.
    1. Druss BG, Zhao L, Von Esenwein S, Morrato EH, Marcus SC. Understanding excess mortality in persons with mental illness: 17-year follow up of a nationally representative US survey. Medical care. 2011;49(6):599–604. 10.1097/MLR.0b013e31820bf86e - DOI - PubMed
    1. Gore FM, Bloem PJ, Patton GC, Ferguson J, Joseph V, Coffey C, et al. Global burden of disease in young people aged 10–24 years: a systematic analysis. The Lancet. 2011;377(9783):2093–102. - PubMed
    1. Fears SC, Kremeyer B, Araya C, Araya X, Bejarano J, Ramirez M, et al. Multisystem component phenotypes of bipolar disorder for genetic investigations of extended pedigrees. JAMA psychiatry. 2014;71(4):375–87. 10.1001/jamapsychiatry.2013.4100 - DOI - PMC - PubMed
    1. Nestler EJ, Barrot M, DiLeone RJ, Eisch AJ, Gold SJ, Monteggia LM. Neurobiology of depression. Neuron. 2002;34(1):13–25. 10.1016/s0896-6273(02)00653-0 - DOI - PubMed

Publication types