Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2017 Dec 11;17(1):246.
doi: 10.1186/s12909-017-1070-5.

Correcting the predictive validity of a selection test for the effect of indirect range restriction

Affiliations

Correcting the predictive validity of a selection test for the effect of indirect range restriction

Stefan Zimmermann et al. BMC Med Educ. .

Abstract

Background: The validity of selection tests is underestimated if it is determined by simply calculating the predictor-outcome correlation found in the admitted group. This correlation is usually attenuated by two factors: (1) the combination of selection variables which can compensate for each other and (2) range restriction in predictor and outcome due to the absence of outcome measures for rejected applicants.

Methods: Here we demonstrate the logic of these artifacts in a situation typical for student selection tests and compare four different methods for their correction: two formulas for the correction of direct and indirect range restriction, expectation maximization algorithm (EM) and multiple imputation by chained equations (MICE). First we show with simulated data how a realistic estimation of predictive validity could be achieved; second we apply the same methods to empirical data from one medical school.

Results: The results of the four methods are very similar except for the direct range restriction formula which underestimated validity.

Conclusion: For practical purposes Thorndike's case C formula is a relatively straightforward solution to the range restriction problem, provided distributional assumptions are met. With EM and MICE more precision is obtained when distributional requirements are not met, but access to a sophisticated statistical package such as R is needed. The use of true score correlation has its own problems and does not seem to provide a better correction than other methods.

Keywords: EM; Mice; Predictive validity; Range restriction; Student selection; Suppression.

PubMed Disclaimer

Conflict of interest statement

Ethics approval and consent to participate

The ethic commission board (Ethik-Kommission der Ärztekammer Hamburg, PV4983) has approved that our admission research in general does not constitute research with human subjects (“kein Forschungsvorhaben am Menschen”) in a clinical sense. The present study is part of a larger project that has been approved by the dean of the Hamburg medical faculty (statement of ethical considerations development of admission procedures at Hamburg Medical School). We obtained written informed consent from our participants.

Consent for publication

Not applicable

Competing interest

The authors declare that they have no competing interests.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Figures

Fig. 1
Fig. 1
a Scattergram of X1 and X2. 20% of 1000 applicants are selected by the sum of X1 and X2; the circular cloud representing all applicants is divided by a diagonal line that separates the top right area from the bottom left area. b This generates a negative correlation between X1 and X2 in the incumbents rx1x2i=0.71. Residuals of X1 after the linear effect of X2 is removed. They are expressed as deviations from the regression line: The residuum of X1 when the influence of X2 is removed is the observed X1 value minus the expected value of the regression X1 on X2
Fig. 2
Fig. 2
Relations between X1, X2, and Y in applicants and incumbents for 20% selection rate. ryx1: first order correlation, βyx1: beta coefficient, ryx12: semipartial correlation
Fig. 3
Fig. 3
The correlation in the full sample of applicants (a) is larger than the correlation in the incumbents (b) due to range restriction: The variances of X1, X2 and Y are restricted
Fig. 4
Fig. 4
a-c Scattergram of Y (study success) with X1 (test results) and precision of the estimation of ryx1a (predictive validity) from different methods when 30%, 20%, and 10% of applicants are selected
Fig. 5
Fig. 5
Scattergram of HAM-Nat and hGPA at Hamburg medical school for the 2011 cohort
Fig. 6
Fig. 6
Relations between HAM-Nat, hGPA, and Study Success in the incumbents. ryx1: first order correlation, βyx1: beta coefficient, ryx12: semipartial correlation

Similar articles

Cited by

References

    1. Sackett PR, Yang H. Correction for range restriction: an expanded typology. J Appl Psych. 2000;85(1):112–118. doi: 10.1037/0021-9010.85.1.112. - DOI - PubMed
    1. Venables WN, Ripley BD. Modern applied statistics with S: springer. 2002.
    1. McManus I, Dewberry C, Nicholson S, Dowell JS, Woolf K, Potts HWW. Construct-level predictive validity of educational attainment and intellectual aptitude tests in medical student selection: meta-regression of six UK longitudinal studies. BMC Med. 2013;11:243. doi: 10.1186/1741-7015-11-243. - DOI - PMC - PubMed
    1. Friedman L, Wall M. Graphical views of suppression and multicollinearity in multiple linear regression. Am Stat. 2005;59(2):127–136. doi: 10.1198/000313005X41337. - DOI
    1. Lutz JG A method for constructing data which illustrate three types of suppressor variables. Educ Psych Meas. 1983;43(2):373–77.

Publication types

MeSH terms

LinkOut - more resources