A review of the explainability and safety of conversational agents for mental health to identify avenues for improvement

Surjodeep Sarkar¹, Manas Gaur¹, Lujie Karen Chen², Muskan Garg³, Biplav Srivastava⁴

Affiliations

¹ Department of Computer Science and Electrical Engineering, University of Maryland, Baltimore County, Baltimore, MD, United States.
² Department of Information Systems, University of Maryland, Baltimore County, Baltimore, MD, United States.
³ Department of AI & Informatics, Mayo Clinic, Rochester, MN, United States.
⁴ AI Institute, University of South Carolina, Columbia, SC, United States.

PMID: 37899961
PMCID: PMC10601652
DOI: 10.3389/frai.2023.1229805

Review

A review of the explainability and safety of conversational agents for mental health to identify avenues for improvement

Surjodeep Sarkar et al. Front Artif Intell. 2023.

. 2023 Oct 12:6:1229805.

doi: 10.3389/frai.2023.1229805. eCollection 2023.

Authors

Surjodeep Sarkar¹, Manas Gaur¹, Lujie Karen Chen², Muskan Garg³, Biplav Srivastava⁴

Affiliations

¹ Department of Computer Science and Electrical Engineering, University of Maryland, Baltimore County, Baltimore, MD, United States.
² Department of Information Systems, University of Maryland, Baltimore County, Baltimore, MD, United States.
³ Department of AI & Informatics, Mayo Clinic, Rochester, MN, United States.
⁴ AI Institute, University of South Carolina, Columbia, SC, United States.

PMID: 37899961
PMCID: PMC10601652
DOI: 10.3389/frai.2023.1229805

Abstract

Virtual Mental Health Assistants (VMHAs) continuously evolve to support the overloaded global healthcare system, which receives approximately 60 million primary care visits and 6 million emergency room visits annually. These systems, developed by clinical psychologists, psychiatrists, and AI researchers, are designed to aid in Cognitive Behavioral Therapy (CBT). The main focus of VMHAs is to provide relevant information to mental health professionals (MHPs) and engage in meaningful conversations to support individuals with mental health conditions. However, certain gaps prevent VMHAs from fully delivering on their promise during active communications. One of the gaps is their inability to explain their decisions to patients and MHPs, making conversations less trustworthy. Additionally, VMHAs can be vulnerable in providing unsafe responses to patient queries, further undermining their reliability. In this review, we assess the current state of VMHAs on the grounds of user-level explainability and safety, a set of desired properties for the broader adoption of VMHAs. This includes the examination of ChatGPT, a conversation agent developed on AI-driven models: GPT3.5 and GPT-4, that has been proposed for use in providing mental health services. By harnessing the collaborative and impactful contributions of AI, natural language processing, and the mental health professionals (MHPs) community, the review identifies opportunities for technological progress in VMHAs to ensure their capabilities include explainable and safe behaviors. It also emphasizes the importance of measures to guarantee that these advancements align with the promise of fostering trustworthy conversations.

Keywords: conversational AI; evaluation metrics; explainable AI; knowledge-infused learning; mental health; safety.

PubMed Disclaimer

Conflict of interest statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Figures

**Figure 1**
Functional taxonomy of mental health conversations. The blocks with black outlines define the scope of this review, and the dotted red line highlights the growing emphasis on question/response generation in mental health conversations between VHMAs and users with mental health conditions. A high-level discourse analysis demands focus on user-level explainability and safety, whereas a low-level analysis focuses on achieving clinically grounded active communications. The light gray blocks and text present the work in the past and are referred in the review.

**Figure 2**
**(Left)** The results achieved by current VMHAs such as WoeBot, Wysa, and general-purpose chatbots such as ChatGPT. **(Right)** An example of an ideal VMHA is a knowledge-driven conversational agent designed for mental health support. This new VMHA utilizes questions based on the Patient Health Questionnaire-9 (PHQ-9) to facilitate a smooth and meaningful conversation about mental health. By incorporating clinical knowledge, the agent can identify signs of mental disturbance in the user and notify MHPs appropriately.

**Figure 3**
A conversational scenario in which a user asks a query with multiple symptoms. Left is a set of generated questions obtained by repetitive prompting ChatGPT. Right is a generation from ALLEVIATE, a knowledge-infused (KI) conversational agent with access to PHQ-9 and clinical knowledge from Mayo Clinic.

**Figure 4**
GPT 3.5 provides user-level explainability when prompted with clinically-relevant words and keyphrases such as *pregnancy, morning sickness, vomiting, nausea*, and *anxiety caused by tranquilizers during pregnancy*. Without these specific keyphrases, GPT 3.5 may produce incorrect inferences [shown in (b)]. When these keyphrases are used as prompts, the explanation provided by GPT 3.5 in (a) becomes more concise compared with the explanation in (b) generated without such prompting. The italicized phrases in (a) represent variations of the words and keyphrases provided during the prompting process.

See this image and copyright information in PMC

References

1. Abd-Alrazaq A. A., Alajlani M., Ali N., Denecke K., Bewick B. M., Househ M. (2021). Perceptions and opinions of patients about mental health chatbots: scoping review. J. Med. Internet Res. 23, e17828. 10.2196/17828 - DOI - PMC - PubMed
1. Ahmad R., Siemon D., Gnewuch U., Robra-Bissantz S. (2022). Designing personality-adaptive conversational agents for mental health care. Inf. Syst. Front. 24, 923–943. 10.1007/s10796-022-10254-9 - DOI - PMC - PubMed
1. Althoff T., Clark K., Leskovec J. (2016). Large-scale analysis of counseling conversations: an application of natural language processing to mental health. Trans. Assoc. Comput. Linguist. 4, 463–476. 10.1162/tacl_a_00111 - DOI - PMC - PubMed
1. Bai Y., Jones A., Ndousse K., Askell A., Chen A., DasSarma N., et al. . (2022a). Training a helpful and harmless assistant with reinforcement learning from human feedback. arXiv preprint arXiv:2204.05862.
1. Bai Y., Kadavath S., Kundu S., Askell A., Kernion J., Jones A., et al. . (2022b). Constitutional ai: harmlessness from ai feedback. arXiv [Preprint]. arXiv:2212.08073. 10.48550/arXiv.2212.08073 - DOI

Publication types

Actions

LinkOut - more resources

Full Text Sources

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

A review of the explainability and safety of conversational agents for mental health to identify avenues for improvement

Affiliations

A review of the explainability and safety of conversational agents for mental health to identify avenues for improvement

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

References

Publication types

LinkOut - more resources

Full Text Sources