. 2015 Jun;24(6):944-50.

doi: 10.1158/1055-9965.EPI-14-1227. Epub 2015 Apr 2.

Improving the quality of biomarker discovery research: the right samples and enough of them

Margaret S Pepe¹, Christopher I Li², Ziding Feng³

Affiliations

¹ Biostatitistics and Biomathematics Program, Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, Washington. mspepe@u.washington.edu.
² Translational Research Program, Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, Washington.
³ Division of Quantitative Sciences, Department of Biostatistics, The University of Texas MD Anderson Cancer Center, Houston, Texas.

PMID: 25837819
PMCID: PMC4452419
DOI: 10.1158/1055-9965.EPI-14-1227

Improving the quality of biomarker discovery research: the right samples and enough of them

Margaret S Pepe et al. Cancer Epidemiol Biomarkers Prev. 2015 Jun.

. 2015 Jun;24(6):944-50.

doi: 10.1158/1055-9965.EPI-14-1227. Epub 2015 Apr 2.

Authors

Margaret S Pepe¹, Christopher I Li², Ziding Feng³

Affiliations

¹ Biostatitistics and Biomathematics Program, Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, Washington. mspepe@u.washington.edu.
² Translational Research Program, Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, Washington.
³ Division of Quantitative Sciences, Department of Biostatistics, The University of Texas MD Anderson Cancer Center, Houston, Texas.

PMID: 25837819
PMCID: PMC4452419
DOI: 10.1158/1055-9965.EPI-14-1227

Abstract

Background: Biomarker discovery research has yielded few biomarkers that validate for clinical use. A contributing factor may be poor study designs.

Methods: The goal in discovery research is to identify a subset of potentially useful markers from a large set of candidates assayed on case and control samples. We recommend the PRoBE design for selecting samples. We propose sample size calculations that require specifying: (i) a definition for biomarker performance; (ii) the proportion of useful markers the study should identify (Discovery Power); and (iii) the tolerable number of useless markers amongst those identified (False Leads Expected, FLE).

Results: We apply the methodology to a study of 9,000 candidate biomarkers for risk of colon cancer recurrence where a useful biomarker has positive predictive value ≥ 30%. We find that 40 patients with recurrence and 160 without recurrence suffice to filter out 98% of useless markers (2% FLE) while identifying 95% of useful biomarkers (95% Discovery Power). Alternative methods for sample size calculation required more assumptions.

Conclusions: Biomarker discovery research should utilize quality biospecimen repositories and include sample sizes that enable markers meeting prespecified performance characteristics for well-defined clinical applications to be identified.

Impact: The scientific rigor of discovery research should be improved.

PubMed Disclaimer

Figures

**Figure 1**
Key steps for choosing samples in a biomarker discovery study.

**Figure 2**
Key steps for choosing samples in discovering biomarkers for predicting recurrence in stage 1 colon cancer patients.

See this image and copyright information in PMC

Cited by

Statistical inference for net benefit measures in biomarker validation studies.
Marsh TL, Janes H, Pepe MS. Marsh TL, et al. Biometrics. 2020 Sep;76(3):843-852. doi: 10.1111/biom.13190. Epub 2019 Nov 28. Biometrics. 2020. PMID: 31732971 Free PMC article.
The search for drug-targetable diagnostic, prognostic and predictive biomarkers in chronic graft-versus-host disease.
Ren HG, Adom D, Paczesny S. Ren HG, et al. Expert Rev Clin Immunol. 2018 May;14(5):389-404. doi: 10.1080/1744666X.2018.1463159. Epub 2018 Apr 19. Expert Rev Clin Immunol. 2018. PMID: 29629613 Free PMC article. Review.
The Clinical Utility of lncRNAs and Their Application as Molecular Biomarkers in Breast Cancer.
Arriaga-Canon C, Contreras-Espinosa L, Aguilar-Villanueva S, Bargalló-Rocha E, García-Gordillo JA, Cabrera-Galeana P, Castro-Hernández C, Jiménez-Trejo F, Herrera LA. Arriaga-Canon C, et al. Int J Mol Sci. 2023 Apr 18;24(8):7426. doi: 10.3390/ijms24087426. Int J Mol Sci. 2023. PMID: 37108589 Free PMC article. Review.
The Metabolic Signature of In Vitro Produced Bovine Embryos Helps Predict Pregnancy and Birth after Embryo Transfer.
Gimeno I, García-Manrique P, Carrocera S, López-Hidalgo C, Valledor L, Martín-González D, Gómez E. Gimeno I, et al. Metabolites. 2021 Jul 27;11(8):484. doi: 10.3390/metabo11080484. Metabolites. 2021. PMID: 34436426 Free PMC article.
Standard Operating Procedures for Biospecimen Collection, Processing, and Storage: From the Consortium for the Study of Chronic Pancreatitis, Diabetes, and Pancreatic Cancer.
Fisher WE, Cruz-Monserrate Z, McElhany AL, Lesinski GB, Hart PA, Ghosh R, Van Buren G, Fishman DS, Rinaudo JAS, Serrano J, Srivastava S, Mace T, Topazian M, Feng Z, Yadav D, Pandol SJ, Hughes SJ, Liu RY, Lu E, Orr R, Whitcomb DC, Abouhamze AS, Steen H, Sellers ZM, Troendle DM, Uc A, Lowe ME, Conwell DL; Consortium for the Study of Chronic Pancreatitis, Diabetes, and Pancreatic Cancer (CPDPC). Fisher WE, et al. Pancreas. 2018 Nov/Dec;47(10):1213-1221. doi: 10.1097/MPA.0000000000001171. Pancreas. 2018. PMID: 30325860 Free PMC article.

See all "Cited by" articles

References

1. Simon RM, Paik S, Hayes DF. Use of archived specimens in evaluation of prognostic and predictive biomarkers. J Natl Cancer Inst. 2009;101:1446–52. - PMC - PubMed
1. Pepe MS, Etzioni R, Feng Z, Potter JD, Thompson M, Thornquist M, et al. Phases of biomarker development for early detection of cancer. J Natl Cancer Inst. 2001;93:1054–61. - PubMed
1. Pepe MS, Feng Z, Janes H, Bossuyt PM, Potter JD. Pivotal evaluation of the accuracy of a biomarker used for classification or prediction: standards for study design. J Natl Cancer Inst. 2008;100:1432–38. - PMC - PubMed
1. Wallstrom G, Anderson KS, LaBaer J. Biomarker discovery for heterogeneous diseases. Cancer Epidemiol Biomarkers Prev. 2013;22:747–55. - PMC - PubMed
1. Skates SJ, Gillette MA, LaBaer J, Carr SA, Anderson L, Liebler DC, et al. Statistical design for biospecimen cohort size in proteomics-based discovery and verification studies. J Proteome Res. 2013;12:5383–94. - PMC - PubMed

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Improving the quality of biomarker discovery research: the right samples and enough of them

Affiliations

Improving the quality of biomarker discovery research: the right samples and enough of them

Authors

Affiliations

Abstract

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Substances

Grants and funding

LinkOut - more resources

Full Text Sources

Medical

Abstract

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Substances

Related information

Grants and funding

LinkOut - more resources

Full Text Sources

Medical