. 2018 Mar 13;115(11):2578-2583.

doi: 10.1073/pnas.1708283115. Epub 2018 Mar 12.

Training replicable predictors in multiple studies

Prasad Patil^{1

2}, Giovanni Parmigiani^{3

2}

Affiliations

¹ Department of Biostatistics and Computational Biology, Dana-Farber Cancer Institute, Boston, MA 02215.
² Department of Biostatistics, Harvard T. H. Chan School of Public Health, Boston, MA 02115.
³ Department of Biostatistics and Computational Biology, Dana-Farber Cancer Institute, Boston, MA 02215; gp@jimmy.harvard.edu.

PMID: 29531060
PMCID: PMC5856504
DOI: 10.1073/pnas.1708283115

Training replicable predictors in multiple studies

Prasad Patil et al. Proc Natl Acad Sci U S A. 2018.

. 2018 Mar 13;115(11):2578-2583.

doi: 10.1073/pnas.1708283115. Epub 2018 Mar 12.

Authors

Prasad Patil^{1

2}, Giovanni Parmigiani^{3

2}

Affiliations

¹ Department of Biostatistics and Computational Biology, Dana-Farber Cancer Institute, Boston, MA 02215.
² Department of Biostatistics, Harvard T. H. Chan School of Public Health, Boston, MA 02115.
³ Department of Biostatistics and Computational Biology, Dana-Farber Cancer Institute, Boston, MA 02215; gp@jimmy.harvard.edu.

PMID: 29531060
PMCID: PMC5856504
DOI: 10.1073/pnas.1708283115

Abstract

This article considers replicability of the performance of predictors across studies. We suggest a general approach to investigating this issue, based on ensembles of prediction models trained on different studies. We quantify how the common practice of training on a single study accounts in part for the observed challenges in replicability of prediction performance. We also investigate whether ensembles of predictors trained on multiple studies can be combined, using unique criteria, to design robust ensemble learners trained upfront to incorporate replicability into different contexts and populations.

Keywords: cross-study validation; ensemble learning; machine learning; replicability; validation.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest.

Figures

**Fig. 1.**
The architecture of a CSL, illustrated with six studies divided into three subsets, two SSLs, and general weights.

**Fig. 2.**
Ratios of validation rms errors (rmses) to the rmse of the Reg-a weighting strategy, averaged over 100 simulation iterations, as we vary the coefficient perturbation window. *Top* seven panels correspond to different choices of SSL; the colors correspond to different weighting schemes. *Bottom* displays average validation rmse of the best-performing scheme (indicated with color) for each SSL (indicated by letter) at each perturbation window.

**Fig. 3.**
Differential discrimination of alternative classifiers. For each classifier we compute the hazard ratio associated with a change of one unit in the score vector, as evaluated in the validation datasets. The vertical scale is the ratio of this performance measure to that of the Reg-a CSL. Colors indicate classes of learning strategies: White is weighted CSLs with weights addressing cross-study prediction, purple is CSLs with fixed weights, orange is merging and meta-analysis, and blue is a SSL trained on the TCGA dataset. Horizontal lines are at $y = 1$ and at median performance of CS-Avg.

See this image and copyright information in PMC

References

1. Committee on Applied and Theoretical Statistics, Board on Mathematical Sciences and Their Applications, Division on Engineering and Physical Sciences, National Academies of Sciences, Engineering, and Medicine . In: Statistical Challenges in Assessing and Fostering the Reproducibility of Scientific Results, Summary of a Workshop. Schwalbe M, editor. National Academies Press; Washington, DC: 2016. - PubMed
1. Kenett RS, Shmueli G. Clarifying the terminology that describes scientific reproducibility. Nat Methods. 2015;12:699–699. - PubMed
1. Open Source Collaboration et al. Estimating the reproducibility of psychological science. Science. 2015;349:aac4716. - PubMed
1. Heller R, Bogomolov M, Benjamini Y. Deciding whether follow-up studies have replicated findings in a preliminary large-scale omics study. Proc Natl Acad Sci USA. 2014;111:16262–16267. - PMC - PubMed
1. Simon R, Radmacher MD, Dobbin K, McShane LM. Pitfalls in the use of DNA microarray data for diagnostic and prognostic classification. J Natl Cancer Inst. 2003;95:14–18. - PubMed

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Grants and funding

LinkOut - more resources

Full Text Sources
Other Literature Sources
- The Lens - Patent Citations Database
- scite Smart Citations

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Training replicable predictors in multiple studies

Affiliations

Training replicable predictors in multiple studies

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

References

Publication types

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources

Other Literature Sources