Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2020 Feb 16;8(1):e000262.
doi: 10.1136/fmch-2019-000262. eCollection 2020.

Variable selection strategies and its importance in clinical prediction modelling

Affiliations

Variable selection strategies and its importance in clinical prediction modelling

Mohammad Ziaul Islam Chowdhury et al. Fam Med Community Health. .

Abstract

Clinical prediction models are used frequently in clinical practice to identify patients who are at risk of developing an adverse outcome so that preventive measures can be initiated. A prediction model can be developed in a number of ways; however, an appropriate variable selection strategy needs to be followed in all cases. Our purpose is to introduce readers to the concept of variable selection in prediction modelling, including the importance of variable selection and variable reduction strategies. We will discuss the various variable selection techniques that can be applied during prediction model building (backward elimination, forward selection, stepwise selection and all possible subset selection), and the stopping rule/selection criteria in variable selection (p values, Akaike information criterion, Bayesian information criterion and Mallows' Cp statistic). This paper focuses on the importance of including appropriate variables, following the proper steps, and adopting the proper methods when selecting variables for prediction models.

Keywords: epidemiology.

PubMed Disclaimer

Conflict of interest statement

Competing interests: None declared.

Figures

Figure 1
Figure 1
Variable selection steps. AIC, Akaike information criterion; BIC, Bayesian information criterion.

References

    1. Ratner B. Variable selection methods in regression: Ignorable problem, outing notable solution. Journal of Targeting, Measurement and Analysis for Marketing 2010;18:65–75. 10.1057/jt.2009.26 - DOI
    1. Steyerberg EW, Vergouwe Y. Towards better clinical prediction models: seven steps for development and an ABCD for validation. Eur Heart J 2014;35:1925–31. 10.1093/eurheartj/ehu207 - DOI - PMC - PubMed
    1. Lee Y-ho, Bang H, Kim DJ. How to establish clinical prediction models. Endocrinol Metab 2016;31:38–44. 10.3803/EnM.2016.31.1.38 - DOI - PMC - PubMed
    1. Guyon I, Elisseeff A. An introduction to variable and feature selection. Journal of machine learning research 2003;3:1157–82.
    1. Kuhn M, Johnson K. Applied predictive modeling. New York: Springer, 2013.

LinkOut - more resources