Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2016 Mar 28;11(3):e0152330.
doi: 10.1371/journal.pone.0152330. eCollection 2016.

A Missing Data Approach to Correct for Direct and Indirect Range Restrictions with a Dichotomous Criterion: A Simulation Study

Affiliations

A Missing Data Approach to Correct for Direct and Indirect Range Restrictions with a Dichotomous Criterion: A Simulation Study

Andreas Pfaffel et al. PLoS One. .

Abstract

A recurring methodological problem in the evaluation of the predictive validity of selection methods is that the values of the criterion variable are available for selected applicants only. This so-called range restriction problem causes biased population estimates. Correction methods for direct and indirect range restriction scenarios have widely studied for continuous criterion variables but not for dichotomous ones. The few existing approaches are inapplicable because they do not consider the unknown base rate of success. Hence, there is a lack of scientific research on suitable correction methods and the systematic analysis of their accuracies in the cases of a naturally or artificially dichotomous criterion. We aim to overcome this deficiency by viewing the range restriction problem as a missing data mechanism. We used multiple imputation by chained equations to generate complete criterion data before estimating the predictive validity and the base rate of success. Monte Carlo simulations were conducted to investigate the accuracy of the proposed correction in dependence of selection ratio, predictive validity, and base rate of success in an experimental design. In addition, we compared our proposed missing data approach with Thorndike's well-known correction formulas that have only been used in the case of continuous criterion variables so far. The results show that the missing data approach is more accurate in estimating the predictive validity than Thorndike's correction formulas. The accuracy of our proposed correction increases as the selection ratio and the correlation between predictor and criterion increase. Furthermore, the missing data approach provides a valid estimate of the unknown base rate of success. On the basis of our findings, we argue for the use of multiple imputation by chained equations in the evaluation of the predictive validity of selection methods when the criterion is dichotomous.

PubMed Disclaimer

Conflict of interest statement

Competing Interests: The authors have declared that no competing interests exist.

Figures

Fig 1
Fig 1. Missing data patterns under range restriction.
(a) Direct range restriction scenario (selection on X), and (b) indirect range restriction scenario (selection on Z). The shaded areas in Y represent the location of the missing values in the dataset.
Fig 2
Fig 2. Direct range restriction (DRR): Root mean square error (RMSE) of the estimates of the predictive validity (r^b and r^pb).
r^b is the estimate of the biserial correlation coefficient for an artificially dichotomous criterion variable, and r^pb is the estimate of the point-biserial correlation coefficient for a naturally dichotomous criterion variable.
Fig 3
Fig 3. Indirect range restriction (IRR): Root mean square error (RMSE) of the estimates of the predictive validity (r^b and r^pb).
r^b is the estimate of the biserial correlation coefficient for an artificially dichotomous criterion variable, and r^pb is the estimate of the point-biserial correlation coefficient for a naturally dichotomous criterion variable.
Fig 4
Fig 4. Direct range restriction (DRR): Effects of a weak, moderate, and strong predictive validity on the root mean square error (RMSE) of the estimates of the predictive validity (r^b and r^pb).
r^b is the estimate of the biserial correlation coefficient for an artificially dichotomous criterion variable, and r^pb is the estimate of the point-biserial correlation coefficient for a naturally dichotomous criterion variable.
Fig 5
Fig 5. Indirect range restriction (IRR): Effects of a weak, moderate, and strong predictive validity on the root mean square error (RMSE) of the estimates of the predictive validity (r^b and r^pb).
r^b is the estimate of the biserial correlation coefficient for an artificially dichotomous criterion variable, and r^pb is the estimate of the point-biserial correlation coefficient for a naturally dichotomous criterion variable.
Fig 6
Fig 6. Indirect range restriction (IRR): Effects of a weak, moderate, and strong relationship between predictor X and selection variable Z on the root mean square error (RMSE) of the estimates of the predictive validity (r^b and r^pb).
r^b is the estimate of the biserial correlation coefficient for an artificially dichotomous criterion variable, and r^pb is the estimate of the point-biserial correlation coefficient for a naturally dichotomous criterion variable.

References

    1. Alexander RA, Barrett GV, Alliger GM, Carson KP. Towards a general model of non-random sampling and the impact on population correlation: Generalizations of Berkson’s Fallacy and restriction of range. Br J Math Stat Psychol. 1986;39: 90–105. 10.1111/j.2044-8317.1986.tb00849.x - DOI - PubMed
    1. Berk RA. An introduction to sample selection bias in sociological data. Am Sociol Rev. 1983;48: 386–398. 10.2307/2095230 - DOI
    1. Gross AL, McGanney ML. The restriction of range problem and nonignorable selection processes. J Appl Psychol. 1987;72: 604–610. 10.1037/0021-9010.72.4.604 - DOI
    1. Sackett PR, Yang H. Correction for range restriction: An expanded typology. J Appl Psychol. 2000;85: 112–118. 10.1037/0021-9010.85.1.112 - DOI - PubMed
    1. Thorndike RL. Personnel selection: Test and measurement techniques. New York: Wiley; 1949.

Publication types

LinkOut - more resources