Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Jun 23;4(1):26.
doi: 10.1038/s44184-025-00136-8.

Development of the treatment prediction model in the artificial intelligence in depression - medication enhancement study

Affiliations

Development of the treatment prediction model in the artificial intelligence in depression - medication enhancement study

David Benrimoh et al. Npj Ment Health Res. .

Abstract

We introduce an artificial intelligence model to personalize treatment in major depression, which was deployed in the Artificial Intelligence in Depression: Medication Enhancement Study. We predict probabilities of remission across multiple pharmacological treatments, validate model predictions, and examine them for biases. Data from 9042 adults with moderate to severe major depression from antidepressant clinical trials were used to train a deep learning model. On the held-out test-set, the model demonstrated an AUC of 0.65, outperformed a null model (p = 0.01). The model increased population remission rate in hypothetical and actual improvement testing. While the model identified escitalopram as generally outperforming other drugs (consistent with the input data), there was otherwise significant variation in drug rankings. The model did not amplify potentially harmful biases. We demonstrate the first model capable of predicting outcomes for 10 treatments, intended to be used at or near the start of treatment to personalize treatment selection.

PubMed Disclaimer

Conflict of interest statement

Competing interests: D.B., C.A., K.P., R.F., J.M., S.I. were employees and/or shareholders of Aifred Health and supported this research in the context of their work for Aifred Health. DB receives a salary award from the Fonds de recherche du Québec – Santé (FRQS). A.K., S.P. have previously received an honorarium from Aifred Health. J.F.K. has been provided options in Aifred Health. K.H. and G.T. do not report any competing interests. All authors have read and approved the manuscript.

Figures

Fig. 1
Fig. 1
PRISMA diagram for study inclusion and included medications.
Fig. 2
Fig. 2. Test Set Calibration Plots.
Calibration plot for the test set. ECE expected calibration error, MCE maximum calibration error.
Fig. 3
Fig. 3. Bias Plots.
Plots of observed and predicted remission probability for drug, sex, ethnicity, and age groups (extreme right). Error bars represent standard error. A shows result for the training set; B for the validation set; and C for the test set.
Fig. 4
Fig. 4. Selected Sensitivity Plots- Test Set.
Plots of remission rate predicted by the model as each variable (suicidal ideation in (A) and baseline weight loss in (B) at baseline is increased.
Fig. 5
Fig. 5. Example output from the Clinical Decision Support System.
Top: Example result for (A) escitalopram and (B) duloxetine prediction. Information displayed includes drug class, typical dose, effective dose range, minimum/maximum dose range, dosing tips, clinical pearls, patient-specific remission probability for escitalopram, patient-specific remission probability across treatments, and population baseline. [Note: Duloxetine is an SNRI; this was a known error which was intended to be fixed in a software patch to be done after this screenshot was taken]. Bottom: Example interpretability report for (A) Escitalopram and (B) Duloxetine that provides the top 5 most salient variables for each treatment prediction. In this example, a question related to gastrointestinal symptoms is ranked first, which is interesting given that escitalopram is known to have superior tolerability, and the patient is noting significant gastrointestinal symptoms at baseline- a situation in which an SSRI with a more favorable tolerability profile would be preferred. In the second example, for duloxetine, some features are similar (e.g., gastrointestinal sensitivity) but is an interesting difference: hypochondriasis is included, which may be consistent with duloxetine’s known benefit for somatic symptoms.

References

    1. Health Organization, W. Depression and other common mental disorders: global health estimates. (2017).
    1. Greenberg, P. E. et al. The Economic Burden of Adults with Major Depressive Disorder in the United States (2010 and 2018). Pharmacoeconomics39, 653–665 (2021). - PMC - PubMed
    1. Rush, A. J. et al. Acute and longer-term outcomes in depressed outpatients requiring one or several treatment steps: a STAR*D report. Am. J. Psychiatry163, 1905–1917 (2006). - PubMed
    1. Kraus, C., Kadriu, B., Lanzenberger, R., Zarate, C. A. Jr & Kasper, S. Prognosis and improved outcomes in major depression: a review. Transl. Psychiatry9, 127 (2019). - PMC - PubMed
    1. Benrimoh, D. et al. Aifred Health, a Deep Learning Powered Clinical Decision Support System for Mental Health. in The NIPS ’17 Competition: Building Intelligent Systems 251–287 (Springer International Publishing, 2018).

LinkOut - more resources