Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2019 Apr 15;38(8):1303-1320.
doi: 10.1002/sim.8052. Epub 2018 Dec 19.

Design and analysis considerations for combining data from multiple biomarker studies

Affiliations

Design and analysis considerations for combining data from multiple biomarker studies

Abigail Sloan et al. Stat Med. .

Abstract

Pooling data from multiple studies improves estimation of exposure-disease associations through increased sample size. However, biomarker exposure measurements can vary substantially across laboratories and often require calibration to a reference assay prior to pooling. We develop two statistical methods for aggregating biomarker data from multiple studies: the full calibration method and the internalized method. The full calibration method calibrates all biomarker measurements regardless of the availability of reference laboratory measurements while the internalized method calibrates only non-reference laboratory measurements. We compare the performance of these two aggregation methods to two-stage methods. Furthermore, we compare the aggregated and two-stage methods when estimating the calibration curve from controls only or from a random sample of individuals from the study cohort. Our findings include the following: (1) Under random sampling for calibration, exposure effect estimates from the internalized method have a smaller mean squared error than those from the full calibration method. (2) Under the controls-only calibration design, the full calibration method yields effect estimates with the least bias. (3) The two-stage approaches produce average effect estimates that are similar to the full calibration method under a controls only calibration design and the internalized method under a random sample calibration design. We illustrate the methods in an application evaluating the relationship between circulating vitamin D levels and stroke risk in a pooling project of cohort studies.

Keywords: aggregation; between-study variability; calibration; pooling project; two-stage method.

PubMed Disclaimer

Conflict of interest statement

CONFLICT OF INTEREST

None to report.

Figures

FIGURE 1
FIGURE 1
Comparison of percent bias in average calibration parameter estimates over 1000 simulations for a single study when fitting the model E^[Xsi|Wsi]=a^s+b^sWsi, where the true (as, bs) = (3,0.8) and other parameters set at levels as described in Section 3. We consider a common disease with 50% prevalence (panels A and B) and a rare disease with 5% prevalence (panels C and D)
FIGURE 2
FIGURE 2
Comparison of methods as the percentage of participants in the calibration study increases. The sample size of each study remains fixed at 1000 with a calibration study participation rate of 5%, 15%, or 25%. Panels A to F use the controls-only calibration study (COCS) design while panels G to L use the random sample calibration study (RSCS) design

Similar articles

Cited by

  • Spline Analysis of Biomarker Data Pooled from Multiple Matched/Nested Case-Control Studies.
    Wu Y, Gail M, Smith-Warner S, Ziegler R, Wang M. Wu Y, et al. Cancers (Basel). 2022 Jun 3;14(11):2783. doi: 10.3390/cancers14112783. Cancers (Basel). 2022. PMID: 35681763 Free PMC article.
  • Circulating vitamin D and breast cancer risk: an international pooling project of 17 cohorts.
    Visvanathan K, Mondul AM, Zeleniuch-Jacquotte A, Wang M, Gail MH, Yaun SS, Weinstein SJ, McCullough ML, Eliassen AH, Cook NR, Agnoli C, Almquist M, Black A, Buring JE, Chen C, Chen Y, Clendenen T, Dossus L, Fedirko V, Gierach GL, Giovannucci EL, Goodman GE, Goodman MT, Guénel P, Hallmans G, Hankinson SE, Horst RL, Hou T, Huang WY, Jones ME, Joshu CE, Kaaks R, Krogh V, Kühn T, Kvaskoff M, Lee IM, Mahamat-Saleh Y, Malm J, Manjer J, Maskarinec G, Millen AE, Mukhtar TK, Neuhouser ML, Robsahm TE, Schoemaker MJ, Sieri S, Sund M, Swerdlow AJ, Thomson CA, Ursin G, Wactawski-Wende J, Wang Y, Wilkens LR, Wu Y, Zoltick E, Willett WC, Smith-Warner SA, Ziegler RG. Visvanathan K, et al. Eur J Epidemiol. 2023 Jan;38(1):11-29. doi: 10.1007/s10654-022-00921-1. Epub 2023 Jan 3. Eur J Epidemiol. 2023. PMID: 36593337 Free PMC article.
  • Development and validation of a novel combinational index of liquid biopsy biomarker for longitudinal lung cancer patient management.
    Liao MY, Hao YJ, Luo CS, Chen CM, Feng PH, Yang HY, Yao DJ, Lee KY, Tseng FG. Liao MY, et al. J Liq Biopsy. 2024 Sep 10;6:100167. doi: 10.1016/j.jlb.2024.100167. eCollection 2024 Dec. J Liq Biopsy. 2024. PMID: 40027304 Free PMC article.
  • A New Pipeline for the Normalization and Pooling of Metabolomics Data.
    Viallon V, His M, Rinaldi S, Breeur M, Gicquiau A, Hemon B, Overvad K, Tjønneland A, Rostgaard-Hansen AL, Rothwell JA, Lecuyer L, Severi G, Kaaks R, Johnson T, Schulze MB, Palli D, Agnoli C, Panico S, Tumino R, Ricceri F, Verschuren WMM, Engelfriet P, Onland-Moret C, Vermeulen R, Nøst TH, Urbarova I, Zamora-Ros R, Rodriguez-Barranco M, Amiano P, Huerta JM, Ardanaz E, Melander O, Ottoson F, Vidman L, Rentoft M, Schmidt JA, Travis RC, Weiderpass E, Johansson M, Dossus L, Jenab M, Gunter MJ, Lorenzo Bermejo J, Scherer D, Salek RM, Keski-Rahkonen P, Ferrari P. Viallon V, et al. Metabolites. 2021 Sep 17;11(9):631. doi: 10.3390/metabo11090631. Metabolites. 2021. PMID: 34564446 Free PMC article.
  • Statistical methods for biomarker data pooled from multiple nested case-control studies.
    Sloan A, Smith-Warner SA, Ziegler RG, Wang M. Sloan A, et al. Biostatistics. 2021 Jul 17;22(3):541-557. doi: 10.1093/biostatistics/kxz051. Biostatistics. 2021. PMID: 31750898 Free PMC article.

References

    1. Key T, Appleby P, Allen N, Reeves G. Pooling biomarker data from different studies of disease risk, with a focus on endogenous hormones. Cancer Epidemiol Biomark Prev 2010;19(4):960–965. - PMC - PubMed
    1. Smith-Warner S, Spiegelman D, Ritz J, et al. Methods for pooling results of epidemiologic studies the pooling project of prospective studies of diet and cancer. Am J Epidemiol 2006;163(11):1053–1064. - PubMed
    1. Tworoger S, Hankinson S. Use of biomarkers in epidemiologic studies: minimizing the influence of measurement error in the study design and analysis. Cancer Causes Control. 2006;17(7):889–899. - PubMed
    1. McCullough ML, Zoltick ES, Weinstein SJ, et al. Circulating vitamin D and colorectal cancer risk: an international pooling project of 17 cohorts. J Natl Cancer Inst 2018. - PMC - PubMed
    1. Gallicchio L, Helzlsouer K, Chow W, et al. Circulating 25-hydroxyvitamin D and the risk of rarer cancers: design and methods of the cohort consortium vitamin D pooling project of rarer cancers. Am J Epidemiol 2010;172(1):10–20. - PMC - PubMed

Publication types

LinkOut - more resources