Assessing prognosis in depression: comparing perspectives of AI models, mental health professionals and the general public
- PMID: 38199604
- PMCID: PMC10806564
- DOI: 10.1136/fmch-2023-002583
Assessing prognosis in depression: comparing perspectives of AI models, mental health professionals and the general public
Abstract
Background: Artificial intelligence (AI) has rapidly permeated various sectors, including healthcare, highlighting its potential to facilitate mental health assessments. This study explores the underexplored domain of AI's role in evaluating prognosis and long-term outcomes in depressive disorders, offering insights into how AI large language models (LLMs) compare with human perspectives.
Methods: Using case vignettes, we conducted a comparative analysis involving different LLMs (ChatGPT-3.5, ChatGPT-4, Claude and Bard), mental health professionals (general practitioners, psychiatrists, clinical psychologists and mental health nurses), and the general public that reported previously. We evaluate the LLMs ability to generate prognosis, anticipated outcomes with and without professional intervention, and envisioned long-term positive and negative consequences for individuals with depression.
Results: In most of the examined cases, the four LLMs consistently identified depression as the primary diagnosis and recommended a combined treatment of psychotherapy and antidepressant medication. ChatGPT-3.5 exhibited a significantly pessimistic prognosis distinct from other LLMs, professionals and the public. ChatGPT-4, Claude and Bard aligned closely with mental health professionals and the general public perspectives, all of whom anticipated no improvement or worsening without professional help. Regarding long-term outcomes, ChatGPT 3.5, Claude and Bard consistently projected significantly fewer negative long-term consequences of treatment than ChatGPT-4.
Conclusions: This study underscores the potential of AI to complement the expertise of mental health professionals and promote a collaborative paradigm in mental healthcare. The observation that three of the four LLMs closely mirrored the anticipations of mental health experts in scenarios involving treatment underscores the technology's prospective value in offering professional clinical forecasts. The pessimistic outlook presented by ChatGPT 3.5 is concerning, as it could potentially diminish patients' drive to initiate or continue depression therapy. In summary, although LLMs show potential in enhancing healthcare services, their utilisation requires thorough verification and a seamless integration with human judgement and skills.
Keywords: depression; general practice; mental health; nurses; psychiatry.
© Author(s) (or their employer(s)) 2024. Re-use permitted under CC BY-NC. No commercial re-use. See rights and permissions. Published by BMJ.
Conflict of interest statement
Competing interests: None declared.
Figures


Similar articles
-
Comparing the Perspectives of Generative AI, Mental Health Experts, and the General Public on Schizophrenia Recovery: Case Vignette Study.JMIR Ment Health. 2024 Mar 18;11:e53043. doi: 10.2196/53043. JMIR Ment Health. 2024. PMID: 38533615 Free PMC article.
-
PICOT questions and search strategies formulation: A novel approach using artificial intelligence automation.J Nurs Scholarsh. 2025 Jan;57(1):5-16. doi: 10.1111/jnu.13036. Epub 2024 Nov 24. J Nurs Scholarsh. 2025. PMID: 39582233 Free PMC article.
-
Evaluation of the Performance of Generative AI Large Language Models ChatGPT, Google Bard, and Microsoft Bing Chat in Supporting Evidence-Based Dentistry: Comparative Mixed Methods Study.J Med Internet Res. 2023 Dec 28;25:e51580. doi: 10.2196/51580. J Med Internet Res. 2023. PMID: 38009003 Free PMC article.
-
Utility of artificial intelligence-based large language models in ophthalmic care.Ophthalmic Physiol Opt. 2024 May;44(3):641-671. doi: 10.1111/opo.13284. Epub 2024 Feb 25. Ophthalmic Physiol Opt. 2024. PMID: 38404172 Review.
-
The role of large language models in medical image processing: a narrative review.Quant Imaging Med Surg. 2024 Jan 3;14(1):1108-1121. doi: 10.21037/qims-23-892. Epub 2023 Nov 23. Quant Imaging Med Surg. 2024. PMID: 38223123 Free PMC article. Review.
Cited by
-
Large language models outperform general practitioners in identifying complex cases of childhood anxiety.Digit Health. 2024 Dec 15;10:20552076241294182. doi: 10.1177/20552076241294182. eCollection 2024 Jan-Dec. Digit Health. 2024. PMID: 39687523 Free PMC article.
-
The externalization of internal experiences in psychotherapy through generative artificial intelligence: a theoretical, clinical, and ethical analysis.Front Digit Health. 2025 Feb 4;7:1512273. doi: 10.3389/fdgth.2025.1512273. eCollection 2025. Front Digit Health. 2025. PMID: 39968063 Free PMC article.
-
The Application and Ethical Implication of Generative AI in Mental Health: Systematic Review.JMIR Ment Health. 2025 Jun 27;12:e70610. doi: 10.2196/70610. JMIR Ment Health. 2025. PMID: 40577783 Free PMC article. Review.
-
A 25-Year Retrospective of the Use of AI for Diagnosing Acute Stroke: Systematic Review.J Med Internet Res. 2024 Sep 10;26:e59711. doi: 10.2196/59711. J Med Internet Res. 2024. PMID: 39255472 Free PMC article.
-
Effectiveness of generative AI-large language models' recognition of veteran suicide risk: a comparison with human mental health providers using a risk stratification model.Front Psychiatry. 2025 Apr 3;16:1544951. doi: 10.3389/fpsyt.2025.1544951. eCollection 2025. Front Psychiatry. 2025. PMID: 40248601 Free PMC article.
References
-
- Ali O, Abdelbaki W, Shrestha A, et al. . A systematic literature review of artificial intelligence in the healthcare sector: benefits, challenges, methodologies, and functionalities. Journal of Innovation & Knowledge 2023;8:100333. 10.1016/j.jik.2023.100333 - DOI
-
- Mariani MM, Machado I, Nambisan S. Types of innovation and artificial intelligence: a systematic quantitative literature review and research agenda. Journal of Business Research 2023;155:113364. 10.1016/j.jbusres.2022.113364 - DOI
MeSH terms
LinkOut - more resources
Full Text Sources