Toward explainable AI (XAI) for mental health detection based on language behavior

doi:10.3389/fpsyt.2023.1219479

. 2023 Dec 7:14:1219479.

doi: 10.3389/fpsyt.2023.1219479. eCollection 2023.

Toward explainable AI (XAI) for mental health detection based on language behavior

Elma Kerz¹, Sourabh Zanwar¹, Yu Qiao¹, Daniel Wiechmann²

Affiliations

¹ Department of English and American Studies, RWTH Aachen University, Aachen, North Rhine-Westphalia, Germany.
² Institute for Logic, Language and Computation, University of Amsterdam, Amsterdam, Netherlands.

PMID: 38144474
PMCID: PMC10748510
DOI: 10.3389/fpsyt.2023.1219479

Toward explainable AI (XAI) for mental health detection based on language behavior

Elma Kerz et al. Front Psychiatry. 2023.

. 2023 Dec 7:14:1219479.

doi: 10.3389/fpsyt.2023.1219479. eCollection 2023.

Authors

Elma Kerz¹, Sourabh Zanwar¹, Yu Qiao¹, Daniel Wiechmann²

Affiliations

¹ Department of English and American Studies, RWTH Aachen University, Aachen, North Rhine-Westphalia, Germany.
² Institute for Logic, Language and Computation, University of Amsterdam, Amsterdam, Netherlands.

PMID: 38144474
PMCID: PMC10748510
DOI: 10.3389/fpsyt.2023.1219479

Abstract

Advances in artificial intelligence (AI) in general and Natural Language Processing (NLP) in particular are paving the new way forward for the automated detection and prediction of mental health disorders among the population. Recent research in this area has prioritized predictive accuracy over model interpretability by relying on deep learning methods. However, prioritizing predictive accuracy over model interpretability can result in a lack of transparency in the decision-making process, which is critical in sensitive applications such as healthcare. There is thus a growing need for explainable AI (XAI) approaches to psychiatric diagnosis and prediction. The main aim of this work is to address a gap by conducting a systematic investigation of XAI approaches in the realm of automatic detection of mental disorders from language behavior leveraging textual data from social media. In pursuit of this aim, we perform extensive experiments to evaluate the balance between accuracy and interpretability across predictive mental health models. More specifically, we build BiLSTM models trained on a comprehensive set of human-interpretable features, encompassing syntactic complexity, lexical sophistication, readability, cohesion, stylistics, as well as topics and sentiment/emotions derived from lexicon-based dictionaries to capture multiple dimensions of language production. We conduct extensive feature ablation experiments to determine the most informative feature groups associated with specific mental health conditions. We juxtapose the performance of these models against a "black-box" domain-specific pretrained transformer adapted for mental health applications. To enhance the interpretability of the transformers models, we utilize a multi-task fusion learning framework infusing information from two relevant domains (emotion and personality traits). Moreover, we employ two distinct explanation techniques: the local interpretable model-agnostic explanations (LIME) method and a model-specific self-explaining method (AGRAD). These methods allow us to discern the specific categories of words that the information-infused models rely on when generating predictions. Our proposed approaches are evaluated on two public English benchmark datasets, subsuming five mental health conditions (attention-deficit/hyperactivity disorder, anxiety, bipolar disorder, depression and psychological stress).

Keywords: artificial intelligence; automated mental health detection; deep learning; digital NLP-derived biomarkers; digital phenotyping; explainable AI (XAI); machine learning; natural language processing.

PubMed Disclaimer

Conflict of interest statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Figures

**Figure 1**
Examples of text contours from five texts representing the mental health conditions investigated in this work. The colored graphs represent the within-text fluctuations of feature values for five selected features representing each of the five groups of General Linguistic Features (GLF) (Cohesion: Overlap of words across adjacent sentences, Lexical: mean length of word (in characters), Readability: Flesch Kincaid Index, Stylistic: Bigram frequency score obtained from “weblog” register of Corpus of Contemporary American English, Syntactic: Mean length of sentence). All features scores are z-standardized and smoothed using b-spline.

**Figure 2**
Schematic representation of the three model types for mental health detection: **Type 1**: (A) Bidirectional LSTM (BiLSTM) trained on general linguistic features (BiLSTM + GLFs), (B) BiLSTM trained on lexicon-based features (BiLSTM + LBFs) and (C) BiLSTM trained on the combination of GLFs + LBFs; **Type 2**: Pre-trained fine-tuned MentalRoBERTA; **Type 3**: Multi Task-Fusion Models: (A) Emotion-Infused Model, (B) Personality-Infused Model and (C) Emotion-Personality-Infused Model.

**Figure 3**
**Type 1 Models**: BiLSTMs trained on interpretable features.

**Figure 4**
The values of the interpretable features that serve as input of the BiLSTM-based models are extracted by the automatic text analysis (ATA) system. The ATA system distinguishes between general language features and lexicon-based features. General linguistic features tend to result in a dense matrix, where a_ji is the feature score of jth general language feature for s_i. In contrast, lexicon-based features tend to result in a sparse matrix, which is presented here as a set of 3-tuples (*feature id, sentence id, feature score*). A tuple (*j, i, b*_ji) is included in the set, if and only if the jth lexicon-based feature yields a non-zero feature score b_ji for sentence s_i.

**Figure 5**
**Type 2 Model**: Fine-tuned Mental RoBERTa model.

**Figure 6**
**Type 3 Models**: Multi-task Fusion models.

See this image and copyright information in PMC

Cited by

Diagnosis of mental disorders using machine learning: Literature review and bibliometric mapping from 2012 to 2023.
Sharma CM, Chariar VM. Sharma CM, et al. Heliyon. 2024 Jun 8;10(12):e32548. doi: 10.1016/j.heliyon.2024.e32548. eCollection 2024 Jun 30. Heliyon. 2024. PMID: 38975193 Free PMC article. Review.
Artificial Intelligence in Psychiatry: A Review of Biological and Behavioral Data Analyses.
Baydili İ, Tasci B, Tasci G. Baydili İ, et al. Diagnostics (Basel). 2025 Feb 11;15(4):434. doi: 10.3390/diagnostics15040434. Diagnostics (Basel). 2025. PMID: 40002587 Free PMC article. Review.
A CNN-Transformer Fusion Model for Proactive Detection of Schizophrenia Relapse from EEG Signals.
Yasin S, Adeel M, Draz U, Ali T, Hijji M, Ayaz M, Marei AM. Yasin S, et al. Bioengineering (Basel). 2025 Jun 12;12(6):641. doi: 10.3390/bioengineering12060641. Bioengineering (Basel). 2025. PMID: 40564457 Free PMC article.
Choosing the right treatment - combining clinicians' expert knowledge with data-driven predictions.
Maekawa E, Jensen E, van de Ven P, Mathiasen K. Maekawa E, et al. Front Psychiatry. 2024 Sep 3;15:1422587. doi: 10.3389/fpsyt.2024.1422587. eCollection 2024. Front Psychiatry. 2024. PMID: 39290309 Free PMC article.
From Serendipity to Precision: Integrating AI, Multi-Omics, and Human-Specific Models for Personalized Neuropsychiatric Care.
Tanaka M. Tanaka M. Biomedicines. 2025 Jan 12;13(1):167. doi: 10.3390/biomedicines13010167. Biomedicines. 2025. PMID: 39857751 Free PMC article. Review.

See all "Cited by" articles

References

1. Rehm J, Shield KD. Global burden of disease and the impact of mental and addictive disorders. Curr Psychiatry Rep. (2019) 21:1–7. 10.1007/s11920-019-0997-0 - DOI - PubMed
1. American Psychiatric Association . Diagnostic and Statistical Manual of Mental Disorders: DSM-5. Washington, DC: American Psychiatric Association. (2013).
1. Chan SF, La Greca AM. Perceived stress scale (PSS). In: Encyclopedia of Behavioral Medicine. Springer. (2020) p. 1646–8. 10.1007/978-3-030-39903-0_773 - DOI
1. Osman A, Wong JL, Bagge CL, Freedenthal S, Gutierrez PM, Lozano G. The depression anxiety stress Scales 21 (DASS-21): further examination of dimensions, scale reliability, and correlates. J Clin Psychol. (2012) 68:1322–38. 10.1002/jclp.21908 - DOI - PubMed
1. Henderson C, Evans-Lacko S, Thornicroft G. Mental illness stigma, help seeking, and public health programs. Am J Public Health. (2013) 103:777–80. 10.2105/AJPH.2012.301056 - DOI - PMC - PubMed

LinkOut - more resources

Full Text Sources
Research Materials
- NCI CPTC Antibody Characterization Program
Miscellaneous
- NCI CPTAC Assay Portal

[1] Rehm J, Shield KD. Global burden of disease and the impact of mental and addictive disorders. Curr Psychiatry Rep. (2019) 21:1–7. 10.1007/s11920-019-0997-0 - DOI - PubMed

[2] Rehm J, Shield KD. Global burden of disease and the impact of mental and addictive disorders. Curr Psychiatry Rep. (2019) 21:1–7. 10.1007/s11920-019-0997-0 - DOI - PubMed

[3] American Psychiatric Association . Diagnostic and Statistical Manual of Mental Disorders: DSM-5. Washington, DC: American Psychiatric Association. (2013).

[4] American Psychiatric Association . Diagnostic and Statistical Manual of Mental Disorders: DSM-5. Washington, DC: American Psychiatric Association. (2013).

[5] Chan SF, La Greca AM. Perceived stress scale (PSS). In: Encyclopedia of Behavioral Medicine. Springer. (2020) p. 1646–8. 10.1007/978-3-030-39903-0_773 - DOI

[6] Chan SF, La Greca AM. Perceived stress scale (PSS). In: Encyclopedia of Behavioral Medicine. Springer. (2020) p. 1646–8. 10.1007/978-3-030-39903-0_773 - DOI

[7] Osman A, Wong JL, Bagge CL, Freedenthal S, Gutierrez PM, Lozano G. The depression anxiety stress Scales 21 (DASS-21): further examination of dimensions, scale reliability, and correlates. J Clin Psychol. (2012) 68:1322–38. 10.1002/jclp.21908 - DOI - PubMed

[8] Osman A, Wong JL, Bagge CL, Freedenthal S, Gutierrez PM, Lozano G. The depression anxiety stress Scales 21 (DASS-21): further examination of dimensions, scale reliability, and correlates. J Clin Psychol. (2012) 68:1322–38. 10.1002/jclp.21908 - DOI - PubMed

[9] Henderson C, Evans-Lacko S, Thornicroft G. Mental illness stigma, help seeking, and public health programs. Am J Public Health. (2013) 103:777–80. 10.2105/AJPH.2012.301056 - DOI - PMC - PubMed

[10] Henderson C, Evans-Lacko S, Thornicroft G. Mental illness stigma, help seeking, and public health programs. Am J Public Health. (2013) 103:777–80. 10.2105/AJPH.2012.301056 - DOI - PMC - PubMed

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Toward explainable AI (XAI) for mental health detection based on language behavior

Affiliations

Toward explainable AI (XAI) for mental health detection based on language behavior

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

LinkOut - more resources

Full Text Sources

Research Materials

Miscellaneous

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

Related information

LinkOut - more resources

Full Text Sources

Research Materials

Miscellaneous