Quantifying how diagnostic test accuracy depends on threshold in a meta-analysis

doi:10.1002/sim.8301

. 2019 Oct 30;38(24):4789-4803.

doi: 10.1002/sim.8301. Epub 2019 Sep 30.

Quantifying how diagnostic test accuracy depends on threshold in a meta-analysis

Hayley E Jones¹, Constantine A Gatsonsis^{2

3}, Thomas A Trikalinos³, Nicky J Welton¹, A E Ades¹

Affiliations

¹ Population Health Sciences, Bristol Medical School, University of Bristol, Bristol, UK.
² Department of Biostatistics, Center for Statistical Sciences, Brown University School of Public Health, Providence, Rhode Island.
³ Center for Evidence Synthesis in Health, Brown University School of Public Health, Providence, Rhode Island.

PMID: 31571244
PMCID: PMC6856843
DOI: 10.1002/sim.8301

Quantifying how diagnostic test accuracy depends on threshold in a meta-analysis

Hayley E Jones et al. Stat Med. 2019.

. 2019 Oct 30;38(24):4789-4803.

doi: 10.1002/sim.8301. Epub 2019 Sep 30.

Authors

Hayley E Jones¹, Constantine A Gatsonsis^{2

3}, Thomas A Trikalinos³, Nicky J Welton¹, A E Ades¹

Affiliations

¹ Population Health Sciences, Bristol Medical School, University of Bristol, Bristol, UK.
² Department of Biostatistics, Center for Statistical Sciences, Brown University School of Public Health, Providence, Rhode Island.
³ Center for Evidence Synthesis in Health, Brown University School of Public Health, Providence, Rhode Island.

PMID: 31571244
PMCID: PMC6856843
DOI: 10.1002/sim.8301

Erratum in

Correction: Quantifying how diagnostic test accuracy depends on threshold in a meta-analysis.
Jones HE. Jones HE. Stat Med. 2021 Aug 15;40(18):4166. doi: 10.1002/sim.9103. Epub 2021 Jun 13. Stat Med. 2021. PMID: 34120359 Free PMC article. No abstract available.

Abstract

Tests for disease often produce a continuous measure, such as the concentration of some biomarker in a blood sample. In clinical practice, a threshold C is selected such that results, say, greater than C are declared positive and those less than C negative. Measures of test accuracy such as sensitivity and specificity depend crucially on C, and the optimal value of this threshold is usually a key question for clinical practice. Standard methods for meta-analysis of test accuracy (i) do not provide summary estimates of accuracy at each threshold, precluding selection of the optimal threshold, and furthermore, (ii) do not make use of all available data. We describe a multinomial meta-analysis model that can take any number of pairs of sensitivity and specificity from each study and explicitly quantifies how accuracy depends on C. Our model assumes that some prespecified or Box-Cox transformation of test results in the diseased and disease-free populations has a logistic distribution. The Box-Cox transformation parameter can be estimated from the data, allowing for a flexible range of underlying distributions. We parameterise in terms of the means and scale parameters of the two logistic distributions. In addition to credible intervals for the pooled sensitivity and specificity across all thresholds, we produce prediction intervals, allowing for between-study heterogeneity in all parameters. We demonstrate the model using two case study meta-analyses, examining the accuracy of tests for acute heart failure and preeclampsia. We show how the model can be extended to explore reasons for heterogeneity using study-level covariates.

Keywords: Box-Cox transformation; ROC curve; evidence synthesis; sensitivity; specificity; test cutoff.

PubMed Disclaimer

Figures

**Figure 1**
Observed data on the accuracy of Brain Natriuretic Peptide (Triage assay only) in diagnosing acute heart failure across the full observed range of thresholds. Points from the same study are joined. tpr = true positive rate (sensitivity), fpr = false positive rate (1‐specificity). Also shown are point estimates with 95% credible intervals from a series of stratified bivariate meta‐analyses, in which similar thresholds are grouped and analysed together [Colour figure can be viewed at http://wileyonlinelibrary.com]

**Figure 2**
Observed data on the accuracy of spot urinary protein to creatinine ratio in detecting significant proteinuria in suspected preeclampsia. Points from the same study are joined. tpr = true positive rate (sensitivity), fpr = false positive rate (1‐specificity). Also shown are summary point estimates with 95% confidence intervals from an analysis by Riley et al15 [Colour figure can be viewed at http://wileyonlinelibrary.com]

**Figure 3**
Summary true positive rate (tpr) and false positive rate (fpr) estimates (Models 1‐4) for the Brain natriuretic peptide data across the full range of thresholds. 95% credible intervals and prediction intervals shown are from Model 3 [Colour figure can be viewed at http://wileyonlinelibrary.com]

**Figure 4**
Relationship between average patient age and false positive rate of Brain Natriuretic Peptide (Triage assay) in diagnosing acute heart failure (Model 5 results). Top panel: summary false positive rate across all thresholds for age 60 and age 80. Bottom panel: summary false positive rate at a threshold of 100 ng/litre, by average patient age. Shaded areas are 95% credible intervals [Colour figure can be viewed at http://wileyonlinelibrary.com]

**Figure 5**
Summary true positive rate (tpr) and false positive rate (fpr) estimates (Models 1‐4) for the protein to creatinine ratio data across the full range of thresholds. 95% credible intervals and prediction intervals shown are from Model 3 [Colour figure can be viewed at http://wileyonlinelibrary.com]

See this image and copyright information in PMC

Cited by

An empirical comparison of statistical methods for multiple cut-off diagnostic test accuracy meta-analysis of the Edinburgh postnatal depression scale (EPDS) depression screening tool using published results vs individual participant data.
Negeri ZF, Levis B, Ioannidis JPA, Thombs BD, Benedetti A; DEPRESsion Screening Data (DEPRESSD) EPDS Group. Negeri ZF, et al. BMC Med Res Methodol. 2024 Feb 1;24(1):28. doi: 10.1186/s12874-023-02134-w. BMC Med Res Methodol. 2024. PMID: 38302928 Free PMC article.
Health Economic Decision Tree Models of Diagnostics for Dummies: A Pictorial Primer.
Rautenberg T, Gerritsen A, Downes M. Rautenberg T, et al. Diagnostics (Basel). 2020 Mar 14;10(3):158. doi: 10.3390/diagnostics10030158. Diagnostics (Basel). 2020. PMID: 32183372 Free PMC article.
TOMAS-R: A template to identify and plan analysis for clinically important variation and multiplicity in diagnostic test accuracy systematic reviews.
Mallett S, Dinnes J, Takwoingi Y, de Ruffano LF. Mallett S, et al. Diagn Progn Res. 2022 Sep 22;6(1):18. doi: 10.1186/s41512-022-00131-z. Diagn Progn Res. 2022. PMID: 36131330 Free PMC article.
Faecal immunochemical tests for patients with symptoms suggestive of colorectal cancer: An updated systematic review and multiple-threshold meta-analysis of diagnostic test accuracy studies.
Harnan S, Hamilton J, Simpson E, Clowes M, Biz AN, Whyte S, Ren S, Cooper K, Abulafi M, Ball A, Benton S, Booth R, Carten R, Edgar S, Hamilton W, Kurien M, Merriman L, Monahan K, Heathcote L, Jones HE, Stevenson M. Harnan S, et al. Colorectal Dis. 2024 Dec 17;27(1):e17255. doi: 10.1111/codi.17255. Online ahead of print. Colorectal Dis. 2024. PMID: 39690130 Free PMC article. Review.
Diagnostic accuracy of the aortic dissection detection risk score alone or with D-dimer for acute aortic syndromes: Systematic review and meta-analysis.
Ren S, Essat M, Pandor A, Goodacre S, Ren S, Clowes M, Bima P, Toyofuku M, McLatchie R, Bossone E. Ren S, et al. PLoS One. 2024 Jun 21;19(6):e0304401. doi: 10.1371/journal.pone.0304401. eCollection 2024. PLoS One. 2024. PMID: 38905181 Free PMC article.

See all "Cited by" articles

References

1. Reitsma JB, Glas AS, Rutjes AWS, Scholten RJPM, Bossuyt PM, Zwinderman AH. Bivariate analysis of sensitivity and specificity produces informative summary measures in diagnostic reviews. J Clin Epidemiol. 2005;58(10):982‐990. - PubMed
1. Chu HT, Cole SR. Bivariate meta‐analysis of sensitivity and specificity with sparse data: a generalized linear mixed model approach. J Clin Epidemiol. 2006;59(12):1331‐1332. - PubMed
1. Rutter CM, Gatsonis CA. A hierarchical regression approach to meta‐analysis of diagnostic test accuracy evaluations. Statist Med. 2001;20(19):2865‐2884. - PubMed
1. Steinhauser S, Schumacher M, Rücker G. Modelling multiple thresholds in meta‐analysis of diagnostic test accuracy studies. BMC Med Res Methodol. 2016;16(1):97. - PMC - PubMed
1. Macaskill P, Gatsonis C, Deeks J, Harbord R, Takwoingi Y. Chapter 10: analysing and presenting results In: Deeks JJ, Bossuyt PM, Gatsonis C, eds. Cochrane Handbook for Systematic Reviews of Diagnostic Test Accuracy. London, UK: The Cochrane Collaboration; 2010:1‐59.

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Substances

Actions

Grants and funding

LinkOut - more resources

Full Text Sources
- Europe PubMed Central
- PubMed Central
Medical
- MedlinePlus Health Information

[1] Reitsma JB, Glas AS, Rutjes AWS, Scholten RJPM, Bossuyt PM, Zwinderman AH. Bivariate analysis of sensitivity and specificity produces informative summary measures in diagnostic reviews. J Clin Epidemiol. 2005;58(10):982‐990. - PubMed

[2] Reitsma JB, Glas AS, Rutjes AWS, Scholten RJPM, Bossuyt PM, Zwinderman AH. Bivariate analysis of sensitivity and specificity produces informative summary measures in diagnostic reviews. J Clin Epidemiol. 2005;58(10):982‐990. - PubMed

[3] Chu HT, Cole SR. Bivariate meta‐analysis of sensitivity and specificity with sparse data: a generalized linear mixed model approach. J Clin Epidemiol. 2006;59(12):1331‐1332. - PubMed

[4] Chu HT, Cole SR. Bivariate meta‐analysis of sensitivity and specificity with sparse data: a generalized linear mixed model approach. J Clin Epidemiol. 2006;59(12):1331‐1332. - PubMed

[5] Rutter CM, Gatsonis CA. A hierarchical regression approach to meta‐analysis of diagnostic test accuracy evaluations. Statist Med. 2001;20(19):2865‐2884. - PubMed

[6] Rutter CM, Gatsonis CA. A hierarchical regression approach to meta‐analysis of diagnostic test accuracy evaluations. Statist Med. 2001;20(19):2865‐2884. - PubMed

[7] Steinhauser S, Schumacher M, Rücker G. Modelling multiple thresholds in meta‐analysis of diagnostic test accuracy studies. BMC Med Res Methodol. 2016;16(1):97. - PMC - PubMed

[8] Steinhauser S, Schumacher M, Rücker G. Modelling multiple thresholds in meta‐analysis of diagnostic test accuracy studies. BMC Med Res Methodol. 2016;16(1):97. - PMC - PubMed

[9] Macaskill P, Gatsonis C, Deeks J, Harbord R, Takwoingi Y. Chapter 10: analysing and presenting results In: Deeks JJ, Bossuyt PM, Gatsonis C, eds. Cochrane Handbook for Systematic Reviews of Diagnostic Test Accuracy. London, UK: The Cochrane Collaboration; 2010:1‐59.

[10] Macaskill P, Gatsonis C, Deeks J, Harbord R, Takwoingi Y. Chapter 10: analysing and presenting results In: Deeks JJ, Bossuyt PM, Gatsonis C, eds. Cochrane Handbook for Systematic Reviews of Diagnostic Test Accuracy. London, UK: The Cochrane Collaboration; 2010:1‐59.

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Quantifying how diagnostic test accuracy depends on threshold in a meta-analysis

Affiliations

Quantifying how diagnostic test accuracy depends on threshold in a meta-analysis

Authors

Affiliations

Erratum in

Abstract

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Substances

Grants and funding

LinkOut - more resources

Full Text Sources

Medical