Comparative Study

. 2025 May 30:9:e56057.

doi: 10.2196/56057.

Comparative Efficacy of MultiModal AI Methods in Screening for Major Depressive Disorder: Machine Learning Model Development Predictive Pilot Study

Donghao Chen¹, Pengfei Wang^{2

3}, Xiaolong Zhang^{2

3}, Runqi Qiao¹, Nanxi Li^{2

3}, Xiaodong Zhang^{2

3}, Honggang Zhang¹, Gang Wang^{2

3}

Affiliations

¹ School of Artificial Intelligence, Beijing University of Posts and Telecommunications, Beijing, China.
² Beijing Key Laboratory of Mental Disorders, National Clinical Research Center for Mental Disorders & National Center for Mental Disorders, Beijing Anding Hospital, Capital Medical University, Beijing, China.
³ Advanced Innovation Center for Human Brain Protection, Capital Medical University, Beijing, China.

PMID: 40446148
PMCID: PMC12143584
DOI: 10.2196/56057

Comparative Study

Comparative Efficacy of MultiModal AI Methods in Screening for Major Depressive Disorder: Machine Learning Model Development Predictive Pilot Study

Donghao Chen et al. JMIR Form Res. 2025.

. 2025 May 30:9:e56057.

doi: 10.2196/56057.

Authors

Donghao Chen¹, Pengfei Wang^{2

3}, Xiaolong Zhang^{2

3}, Runqi Qiao¹, Nanxi Li^{2

3}, Xiaodong Zhang^{2

3}, Honggang Zhang¹, Gang Wang^{2

3}

Affiliations

¹ School of Artificial Intelligence, Beijing University of Posts and Telecommunications, Beijing, China.
² Beijing Key Laboratory of Mental Disorders, National Clinical Research Center for Mental Disorders & National Center for Mental Disorders, Beijing Anding Hospital, Capital Medical University, Beijing, China.
³ Advanced Innovation Center for Human Brain Protection, Capital Medical University, Beijing, China.

PMID: 40446148
PMCID: PMC12143584
DOI: 10.2196/56057

Abstract

Background: Conventional approaches for major depressive disorder (MDD) screening rely on two effective but subjective paradigms: self-rated scales and clinical interviews. Artificial intelligence (AI) can potentially contribute to psychiatry, especially through the use of objective data such as objective audiovisual signals.

Objective: This study aimed to evaluate the efficacy of different paradigms using AI analysis on audiovisual signals.

Methods: We recruited 89 participants (mean age, 37.1 years; male: 30/89, 33.7%; female: 59/89, 66.3%), including 41 patients with MDD and 48 asymptomatic participants. We developed AI models using facial movement, acoustic, and text features extracted from videos obtained via a tool, incorporating four paradigms: conventional scale (CS), question and answering (Q&A), mental imagery description (MID), and video watching (VW). Ablation experiments and 5-fold cross-validation were performed using two AI methods to ascertain the efficacy of paradigm combinations. Attention scores from the deep learning model were calculated and compared with correlation results to assess comprehensibility.

Results: In video clip-based analyses, Q&A outperformed MID with a mean binary sensitivity of 79.06% (95%CI 77.06%-83.35%; P=.03) and an effect size of 1.0. Among individuals, the combination of Q&A and MID outperformed MID alone with a mean extent accuracy of 80.00% (95%CI 65.88%-88.24%; P= .01), with an effect size 0.61. The mean binary accuracy exceeded 76.25% for video clip predictions and 74.12% for individual-level predictions across the two AI methods, with top individual binary accuracy of 94.12%. The features exhibiting high attention scores demonstrated a significant overlap with those that were statistically correlated, including 18 features (all Ps<.05), while also aligning with established nonverbal markers.

Conclusions: The Q&A paradigm demonstrated higher efficacy than MID, both individually and in combination. Using AI to analyze audiovisual signals across multiple paradigms has the potential to be an effective tool for MDD screening.

Keywords: MDD; artificial intelligence; computational psychiatry; facial action unit; major depressive disorder; multimodal analysis; multiparadigm analysis.

PubMed Disclaimer

Conflict of interest statement

Conflicts of Interest: None declared.

Figures

**Figure 1.. Components of the Electronic Tool for Depression (ETD). PHQ-9: Patient Health Questionnaire-9; Q&A: question-and-answer.**

**Figure 2.. The global feature extraction method architecture. MFCC: mel frequency cepstral coefficients; MLP: multilayer perceptron.**

See this image and copyright information in PMC

References

1. Depression and other common mental disorders: global health estimates. World Health Organization. 2017. [21-03-2025]. https://www.who.int/publications/i/item/depression-global-health-estimates URL. Accessed.
1. HAMILTON M. A rating scale for depression. J Neurol Neurosurg Psychiatry. 1960 Feb;23(1):56–62. doi: 10.1136/jnnp.23.1.56. doi. Medline. - DOI - PMC - PubMed
1. Beck AT, Steer RA, Ball R, Ranieri W. Comparison of Beck Depression Inventories -IA and -II in psychiatric outpatients. J Pers Assess. 1996 Dec;67(3):588–597. doi: 10.1207/s15327752jpa6703_13. doi. Medline. - DOI - PubMed
1. Kroenke K, Spitzer RL, Williams JB. The PHQ-9: validity of a brief depression severity measure. J Gen Intern Med. 2001 Sep;16(9):606–613. doi: 10.1046/j.1525-1497.2001.016009606.x. doi. Medline. - DOI - PMC - PubMed
1. Pichot P. In: New Results in Depression Research. Hippius H, Klerman GL, Matussek N, editors. 1986. Self-report inventories in the study of depression; pp. 53–58. doi. - DOI

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

LinkOut - more resources

Full Text Sources
- JMIR Publications
- PubMed Central

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Comparative Efficacy of MultiModal AI Methods in Screening for Major Depressive Disorder: Machine Learning Model Development Predictive Pilot Study

Affiliations

Comparative Efficacy of MultiModal AI Methods in Screening for Major Depressive Disorder: Machine Learning Model Development Predictive Pilot Study

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

References

Publication types

MeSH terms

LinkOut - more resources

Full Text Sources