The importance of knowing when to stop. A sequential stopping rule for component-wise gradient boosting

doi:10.3414/ME11-02-0030

. 2012;51(2):178-86.

doi: 10.3414/ME11-02-0030. Epub 2012 Feb 20.

The importance of knowing when to stop. A sequential stopping rule for component-wise gradient boosting

A Mayr¹, B Hofner, M Schmid

Affiliations

Affiliation

¹ Institut für Medizininformatik, Biometrie und Epidemiologie, Friedrich-Alexander-Universität Erlangen-Nürnberg, Waldstr. 6, 91054 Erlangen, Germany. Andreas.Mayr@imbe.med.uni-erlangen.de

PMID: 22344292
DOI: 10.3414/ME11-02-0030

The importance of knowing when to stop. A sequential stopping rule for component-wise gradient boosting

A Mayr et al. Methods Inf Med. 2012.

. 2012;51(2):178-86.

doi: 10.3414/ME11-02-0030. Epub 2012 Feb 20.

Authors

A Mayr¹, B Hofner, M Schmid

Affiliation

¹ Institut für Medizininformatik, Biometrie und Epidemiologie, Friedrich-Alexander-Universität Erlangen-Nürnberg, Waldstr. 6, 91054 Erlangen, Germany. Andreas.Mayr@imbe.med.uni-erlangen.de

PMID: 22344292
DOI: 10.3414/ME11-02-0030

Abstract

Objectives: Component-wise boosting algorithms have evolved into a popular estimation scheme in biomedical regression settings. The iteration number of these algorithms is the most important tuning parameter to optimize their performance. To date, no fully automated strategy for determining the optimal stopping iteration of boosting algorithms has been proposed.

Methods: We propose a fully data-driven sequential stopping rule for boosting algorithms. It combines resampling methods with a modified version of an earlier stopping approach that depends on AIC-based information criteria. The new "subsampling after AIC" stopping rule is applied to component-wise gradient boosting algorithms.

Results: The newly developed sequential stopping rule outperformed earlier approaches if applied to both simulated and real data. Specifically, it improved purely AIC-based methods when used for the microarray-based prediction of the recurrence of metastases for stage II colon cancer patients.

Conclusions: The proposed sequential stopping rule for boosting algorithms can help to identify the optimal stopping iteration already during the fitting process of the algorithm, at least for the most common loss functions.

PubMed Disclaimer

Cited by

Boosted Multivariate Trees for Longitudinal Data.
Pande A, Li L, Rajeswaran J, Ehrlinger J, Kogalur UB, Blackstone EH, Ishwaran H. Pande A, et al. Mach Learn. 2017 Feb;106(2):277-305. doi: 10.1007/s10994-016-5597-1. Epub 2016 Nov 4. Mach Learn. 2017. PMID: 29249866 Free PMC article.
Controlling false discoveries in high-dimensional situations: boosting with stability selection.
Hofner B, Boccuto L, Göker M. Hofner B, et al. BMC Bioinformatics. 2015 May 6;16:144. doi: 10.1186/s12859-015-0575-3. BMC Bioinformatics. 2015. PMID: 25943565 Free PMC article.
Using phenotypic distribution models to predict livestock performance.
Lozano-Jaramillo M, Alemu SW, Dessie T, Komen H, Bastiaansen JWM. Lozano-Jaramillo M, et al. Sci Rep. 2019 Oct 25;9(1):15371. doi: 10.1038/s41598-019-51910-6. Sci Rep. 2019. PMID: 31653937 Free PMC article.
Estimating patients' risk for postoperative delirium from preoperative routine data - Trial design of the PRe-Operative prediction of postoperative DElirium by appropriate SCreening (PROPDESC) study - A monocentre prospective observational trial.
Menzenbach J, Guttenthaler V, Kirfel A, Ricchiuto A, Neumann C, Adler L, Kieback M, Velten L, Fimmers R, Mayr A, Wittmann M; PROPDESC Collaboration Group. Menzenbach J, et al. Contemp Clin Trials Commun. 2019 Dec 4;17:100501. doi: 10.1016/j.conctc.2019.100501. eCollection 2020 Mar. Contemp Clin Trials Commun. 2019. PMID: 31890984 Free PMC article.
Boosting the discriminatory power of sparse survival models via optimization of the concordance index and stability selection.
Mayr A, Hofner B, Schmid M. Mayr A, et al. BMC Bioinformatics. 2016 Jul 22;17:288. doi: 10.1186/s12859-016-1149-8. BMC Bioinformatics. 2016. PMID: 27444890 Free PMC article.

See all "Cited by" articles

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Substances

Actions

LinkOut - more resources

Full Text Sources
- Georg Thieme Verlag Stuttgart, New York
- Ovid Technologies, Inc.

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

The importance of knowing when to stop. A sequential stopping rule for component-wise gradient boosting

Affiliation

The importance of knowing when to stop. A sequential stopping rule for component-wise gradient boosting

Authors

Affiliation

Abstract

Similar articles

Cited by

Publication types

MeSH terms

Substances

LinkOut - more resources

Full Text Sources