Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Jun 19:11:100136.
doi: 10.1016/j.xjmad.2025.100136. eCollection 2025 Sep.

Implications of the choice of method to identify major depressive disorder in large research cohorts

Affiliations

Implications of the choice of method to identify major depressive disorder in large research cohorts

Jorge A Sanchez-Ruiz et al. J Mood Anxiety Disord. .

Abstract

Background: Clinical heterogeneity and variations in methods to identify major depressive disorder (MDD) across studies compromise replicability of research findings. This study evaluated potential implications of different MDD case definitions in a large biobank cohort.

Methods: Among Mayo Clinic Biobank participants, MDD was identified using two methods: self-report MDD in a participant questionnaire (PQ-MDD) and MDD ICD codes in the electronic health record (EHR-MDD). We examined agreement between these definitions and evaluated relationships between case agreement and participant characteristics, including MDD polygenic risk scores (PRS). Finally, we evaluated associations between different MDD case/control definitions and participant characteristics known to be related to MDD.

Results: Among 55,656 participants, 23 % were identified as PQ-MDD cases and 17 % as EHR-MDD cases, with 85 % overall agreement (61 % case agreement) between these definitions. Among participants identified as MDD cases by one method, older and male patients, and those with lower measures of morbidity at enrollment, were less likely to be identified as cases by the other method. The strength of the associations between different MDD case/control definitions and participant characteristics varied depending on whether MDD definitions used the same source of information (i.e., EHR-only, self-report only)-resulting in stronger associations-versus different sources of information (i.e., one from EHR, one from self-report)-resulting in weaker associations.

Conclusion: Our results demonstrate how the methods used to identify patients with history of MDD can affect sample characteristics and risk factor associations, highlighting the importance of considering phenotype ascertainment in the interpretation of research results.

Keywords: Depressive disorder; Electronic health records; Genetic risk score; Major; Mental health; Self report.

PubMed Disclaimer

Conflict of interest statement

The authors declare the following financial interests/personal relationships which may be considered as potential competing interests: Dr. Weissman has received funding from NIMH and Columbia University Institute for Developmental Sciences, receives book royalties from Perseus Press and Oxford Press, and serves on the editorial board of the Journal of Mood & Anxiety Disorders. None of these represent a conflict of interest. Dr. Mann receives royalties for commercial use of the C-SSRS from the Research Foundation for Mental Hygiene and from Columbia University for the Columbia Pathways App. If there are other authors, they declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Figures

Fig. 1
Fig. 1
Factors known to be related to MDD and their association with different MDD case/control definitionsOdds ratios comparing MDD cases vs. controls, across four different case/control definitions. Adjusted for age, gender, and EHR length. Includes two definitions for anxiety disorders, one based on EHR data and one based on self-report (PQ). PRS analyses were limited to European-ancestry participants and additionally adjusted for the first 5 genomic principal components. MDD case/control definitions: PQ-MDD, self-reported being previously diagnosed with depression at any age; EHR-MDD, presence of at least one MDD diagnostic code in EHR up to biobank enrollment; MDD (both), cases with both PQ-MDD and EHR-MDD, controls with neither; MDD (either), cases with either PQ-MDD or EHR-MDD, controls with neither.EHR, electronic health record; ICD, International Classification of Diseases; MDD, major depressive disorder; PHQ-2, 2-item Patient Health Questionnaire; PQ, participant questionnaire; PRS, polygenic risk score.

References

    1. Kessler R.C., Ormel J., Petukhova M., McLaughlin K.A., Green J.G., Russo L.J., et al. Development of Lifetime Comorbidity in the World Health Organization World Mental Health Surveys. Arch Gen Psychiatry. 2011;68(1):90–100. - PMC - PubMed
    1. GBD 2019 Mental Disorders Collaborators Global, regional, and national burden of 12 mental disorders in 204 countries and territories, 1990–2019: a systematic analysis for the Global Burden of Disease Study 2019. Lancet Psychiatry. 2022;9(2):137–150. - PMC - PubMed
    1. Malhi G.S., Mann J.J. Depression. Lancet. 2018;392(10161):2299–2312. - PubMed
    1. Cai N., Choi K.W., Fried E.I. Reviewing the genetics of heterogeneity in depression: operationalizations, manifestations and etiologies. Hum Mol Genet. 2020;29(R1):R10–R18. - PMC - PubMed
    1. Beesley L.J., Salvatore M., Fritsche L.G., Pandit A., Rao A., Brummett C., et al. The emerging landscape of health research based on biobanks linked to electronic health records: Existing resources, statistical challenges, and potential opportunities. Stat Med. 2020;39(6):773–800. - PMC - PubMed

LinkOut - more resources