Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Meta-Analysis
. 2022 Aug:204:300-311.
doi: 10.1016/j.ymeth.2021.11.005. Epub 2021 Nov 12.

External validation of a shortened screening tool using individual participant data meta-analysis: A case study of the Patient Health Questionnaire-Dep-4

Collaborators, Affiliations
Meta-Analysis

External validation of a shortened screening tool using individual participant data meta-analysis: A case study of the Patient Health Questionnaire-Dep-4

Daphna Harel et al. Methods. 2022 Aug.

Abstract

Shortened versions of self-reported questionnaires may be used to reduce respondent burden. When shortened screening tools are used, it is desirable to maintain equivalent diagnostic accuracy to full-length forms. This manuscript presents a case study that illustrates how external data and individual participant data meta-analysis can be used to assess the equivalence in diagnostic accuracy between a shortened and full-length form. This case study compares the Patient Health Questionnaire-9 (PHQ-9) and a 4-item shortened version (PHQ-Dep-4) that was previously developed using optimal test assembly methods. Using a large database of 75 primary studies (34,698 participants, 3,392 major depression cases), we evaluated whether the PHQ-Dep-4 cutoff of ≥ 4 maintained equivalent diagnostic accuracy to a PHQ-9 cutoff of ≥ 10. Using this external validation dataset, a PHQ-Dep-4 cutoff of ≥ 4 maximized the sum of sensitivity and specificity, with a sensitivity of 0.88 (95% CI 0.81, 0.93), 0.68 (95% CI 0.56, 0.78), and 0.80 (95% CI 0.73, 0.85) for the semi-structured, fully structured, and MINI reference standard categories, respectively, and a specificity of 0.79 (95% CI 0.74, 0.83), 0.85 (95% CI 0.78, 0.90), and 0.83 (95% CI 0.80, 0.86) for the semi-structured, fully structured, and MINI reference standard categories, respectively. While equivalence with a PHQ-9 cutoff of ≥ 10 was not established, we found the sensitivity of the PHQ-Dep-4 to be non-inferior to that of the PHQ-9, and the specificity of the PHQ-Dep-4 to be marginally smaller than the PHQ-9.

Keywords: Equivalence testing; Optimal test assembly; Self-report questionnaire; Sensitivity; Specificity.

PubMed Disclaimer

Figures

Fig. 1.
Fig. 1.
Flow diagram of study selection process.
Fig. 2.
Fig. 2.
Receiver-operating curve for each reference standard category. Points represent cutoffs of 0 (right) to 12 (left) for each reference standard category. X marks the PHQ−9 cutoff of ≥ 10.

References

    1. Goetz C, et al., Item reduction based on rigorous methodological guidelines is necessary to maintain validity when shortening composite measurement scales, J. Clin. Epidemiol 66 (7) (2013) 710–718, 10.1016/J.jclinepi.2012.12.015. - DOI - PubMed
    1. Coste J, Guillemin F, Pouchot J, Fermanian J, Methodological approaches to shortening composite measurement scales, J. Clin. Epidemiol 50 (3) (1997) 247–252, 10.1016/S0895-4356(96)00363-0. - DOI - PubMed
    1. Kruyen PM, Emons WHM, Sijtsma K, On the Shortcomings of Shortened Tests: A Literature Review, Int. J. Test 13 (3) (2013) 223–248, 10.1080/15305058.2012.703734. - DOI
    1. Stanton PC, Sinar JM, Balzer EF, Smith WK, Issues and strategies for reducing the length of self-report scales, Pers. Psychol 55 (1) (2002) 167–194.
    1. Kroenke K, Spitzer RL, The PHQ-9: a new depression diagnostic and severity measure, SLACK Incorporated Thorofare, NJ, 2002.

Publication types

LinkOut - more resources