This is a preprint.

It has not yet been peer reviewed by a journal.

The National Library of Medicine is running a pilot to include preprints that result from research funded by NIH in PMC and PubMed.

[Preprint]. 2025 Oct 17:2025.10.13.25337771.

doi: 10.1101/2025.10.13.25337771.

Machine learning-optimized perinatal depression screening: Maximum impact, minimal burden

Eric Hurwitz, Caroline Shell, Kritika Chugh, Veerle Bergink, Rena C Patel, Crystal Schiller, Melissa A Haendel

PMID: 41282910
PMCID: PMC12633091
DOI: 10.1101/2025.10.13.25337771

Machine learning-optimized perinatal depression screening: Maximum impact, minimal burden

Eric Hurwitz et al. medRxiv. 2025.

[Preprint]. 2025 Oct 17:2025.10.13.25337771.

doi: 10.1101/2025.10.13.25337771.

Authors

Eric Hurwitz, Caroline Shell, Kritika Chugh, Veerle Bergink, Rena C Patel, Crystal Schiller, Melissa A Haendel

PMID: 41282910
PMCID: PMC12633091
DOI: 10.1101/2025.10.13.25337771

Abstract

Introduction: Perinatal depression affects up to 30% of pregnant and postpartum women, which has increased since the COVID-19 pandemic, making rapidly identifying affected women a high clinical priority. While screening tools like the Edinburgh Postnatal Depression Scale (EPDS) are widely used, brevity is important for busy clinical practice to reduce administration time and patient burden. Current methods to shorten assessments rely on traditional psychometric approaches, rather than machine learning (ML) methods that could optimize predictive accuracy.

Methods: We developed an ML framework using National Clinical Cohort Collaborative (N3C) data to predict full 10-item EPDS scores from shortened question subsets (n=22,924). We evaluated all 2-5 item combinations using linear regression, validating performance across multiple cohorts including postpartum women (n=7,750) and external pregnancy populations (n=1,217). For additional validation, we applied our approach to the PHQ-9 (n=398,606) to test generalizability. Binary classification models using clinical thresholds (≥13) determined EPDS screening accuracy. Decision curve analysis was performed to assess the clinical utility of our ML method.

Results: The optimal 2-question EPDS combinations Q4+Q8 (anxiety/sadness) and Q5+Q8 (scared/sadness) both achieved R ² =0.70. Binary classification demonstrated strong performance (sensitivity=0.68-0.72, specificity=0.98-0.99). The framework generalized across postpartum subsets, external pregnancy cohorts, and PHQ-9 validation (R ² =0.64-0.73). Adding covariates did not improve performance. Decision curve analysis showed our ML approach had superior clinical benefit (0.01-0.03) versus traditional additive scoring.

Conclusion/implications: Our ML framework suggests a reduced assessment burden with two EPDS questions maintains predictive accuracy as the full-item EPDS. With ∼3.6 million annual U.S. births, this approach could identify additional positive perinatal depression screens, enhancing screening implementation across clinical settings.

PubMed Disclaimer

Publication types

Actions

LinkOut - more resources

Full Text Sources
- Cold Spring Harbor Laboratory
- PubMed Central

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

This is a preprint.

Machine learning-optimized perinatal depression screening: Maximum impact, minimal burden

Machine learning-optimized perinatal depression screening: Maximum impact, minimal burden

Authors

Abstract

Publication types

LinkOut - more resources

Full Text Sources