Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Jan 11;6(1):1-7.
doi: 10.5334/cpsy.78. eCollection 2022.

Feasibility Analysis of Phenotype Quantification from Unstructured Clinical Interactions

Affiliations

Feasibility Analysis of Phenotype Quantification from Unstructured Clinical Interactions

Daniel S Barron et al. Comput Psychiatr. .

Abstract

We conducted a feasibility analysis to determine the quality of data that could be collected ambiently during routine clinical conversations. We used inexpensive, consumer-grade hardware to record unstructured dialogue and open-source software tools to quantify and model face, voice (acoustic and language) and movement features. We used an external validation set to perform proof-of-concept predictive analyses and show that clinically relevant measures can be produced without a restrictive protocol.

Keywords: acoustic; conversation; digital phenotype; facial feature; voice.

PubMed Disclaimer

Conflict of interest statement

The authors have no competing interests to declare.

Figures

Quantitative face and voice features versus clinical progress
Figure 1
Quantitative face and voice features versus clinical progress. (A) Face psychomotor activity (gaze and head pose in radians per second scaled from 0 to 1) sized by BPRS depression score. The non-anxious depressed patients (3,5,6,7) tended to have more movement as their depressive mood scores increased and less overall than non-depressed patients. Patient 8 was anxious and depressed, hence her head movement decreased as she recovered. Individual patients are assigned their own color and are numbered by session (Patient 4’s third session is represented by a red dot with the number 3). The background density plot (blue hues) provides context from a larger (142 sessions), independently collected dataset, illustrating how features derived from unstructured conversation fall within the scope of a structured exam. (B) Vowel space density plots visually reveal the trajectory of acoustic changes in a depressed patient who received ketamine infusion. Reduced vowel space is clearly visible during an early session (representing a more monotonic voice, left) compared with a later session (representing a more varied voice, right) for the same patient. The BPRS depressive mood score for the first session was 6 and 0 for the second. Restricted vowel space is a well-documented acoustic feature which has been shown to correlate with depressive mood (Scherer et al., 2016).
Conversational effort and speech content can be measured from unstructured clinical conversation
Figure 2
Conversational effort and speech content can be measured from unstructured clinical conversation. A) Conversational effort illustrates words per session for clinician and patient. The Patient 8 (grey circles) displays an anxious depression phenotype producing markedly more words than the clinician. Participant feature data plotted longitudinally exposes subtle changes in objective measures that, we speculate, the human brain would find difficult to identify from memory. The cross-sectional plot facilitates patient comparison. B) Speech content analysis quantifies diminution of perseveration. Here, we used semantic analysis to calculate the cosine distance between the single perseverating patient’s speech vectors to the GloVe vector for the concept “consulting.” As the patient’s perseveration decreased, this topic became less frequent. No other patients displayed this behavior.

References

    1. Agurto, C., et al. (2019). Analyzing progression of motor and speech impairment in ALS. Conf Proc Annu Int Conf Ieee Eng Medicine Biology Soc Ieee Eng Medicine Biology Soc Annu Conf, 2019, 6097–6102. DOI: 10.1109/EMBC.2019.8857300 - DOI - PubMed
    1. Baltrusaitis, T., Zadeh, A., Lim, Y. C., & Morency, L.-P. (2018). OpenFace 2.0: Facial Behavior Analysis Toolkit. 2018 13th Ieee Int Conf Automatic Face Gesture Recognit Fg 2018, 59–66. DOI: 10.1109/FG.2018.00019 - DOI
    1. Bedi, G., et al. (2015). Automated analysis of free speech predicts psychosis onset in high-risk youths. NPJ schizophrenia, 1, 15030. DOI: 10.1038/npjschz.2015.30 - DOI - PMC - PubMed
    1. Bishay, M., Palasek, P., Priebe, S., & Patras, I. (2019). SchiNet: Automatic Estimation of Symptoms of Schizophrenia from Facial Behaviour Analysis. Ieee T Affect Comput, 1–1.
    1. Cohn, J. F., et al. (2009). Detecting Depression from Facial Actions and Vocal Prosody. 2009 3rd Int Conf Affect Comput Intelligent Interact Work, 1–7. DOI: 10.1109/ACII.2009.5349358 - DOI - PMC - PubMed

LinkOut - more resources