Enhancing measurement validity in diverse populations: Modern approaches to evaluating differential item functioning
- PMID: 37431154
- DOI: 10.1111/bmsp.12316
Enhancing measurement validity in diverse populations: Modern approaches to evaluating differential item functioning
Abstract
When developing and evaluating psychometric measures, a key concern is to ensure that they accurately capture individual differences on the intended construct across the entire population of interest. Inaccurate assessments of individual differences can occur when responses to some items reflect not only the intended construct but also construct-irrelevant characteristics, like a person's race or sex. Unaccounted for, this item bias can lead to apparent differences on the scores that do not reflect true differences, invalidating comparisons between people with different backgrounds. Accordingly, empirically identifying which items manifest bias through the evaluation of differential item functioning (DIF) has been a longstanding focus of much psychometric research. The majority of this work has focused on evaluating DIF across two (or a few) groups. Modern conceptualizations of identity, however, emphasize its multi-determined and intersectional nature, with some aspects better represented as dimensional than categorical. Fortunately, many model-based approaches to modelling DIF now exist that allow for simultaneous evaluation of multiple background variables, including both continuous and categorical variables, and potential interactions among background variables. This paper provides a comparative, integrative review of these new approaches to modelling DIF and clarifies both the opportunities and challenges associated with their application in psychometric research.
Keywords: differential item functioning; factor analysis; item response theory; measurement; psychometrics.
© 2023 British Psychological Society.
Similar articles
-
Modern psychometric methods for detection of differential item functioning: application to cognitive assessment measures.Stat Med. 2000 Jun 15-30;19(11-12):1651-83. doi: 10.1002/(sici)1097-0258(20000615/30)19:11/12<1651::aid-sim453>3.0.co;2-h. Stat Med. 2000. PMID: 10844726
-
Are medical school preclinical tests biased for sex and race? A differential item functioning analysis.BMC Med Educ. 2025 Jan 29;25(1):146. doi: 10.1186/s12909-024-06540-6. BMC Med Educ. 2025. PMID: 39881271 Free PMC article.
-
Demographic and functional differences among social security disability claimants.Qual Life Res. 2021 Jun;30(6):1757-1768. doi: 10.1007/s11136-021-02765-w. Epub 2021 Feb 21. Qual Life Res. 2021. PMID: 33611754
-
Differential item functioning on the Mini-Mental State Examination. An application of the Mantel-Haenszel and standardization procedures.Med Care. 2006 Nov;44(11 Suppl 3):S107-14. doi: 10.1097/01.mlr.0000245182.36914.4a. Med Care. 2006. PMID: 17060817 Review.
-
Identification of differential item functioning using item response theory and the likelihood-based model comparison approach. Application to the Mini-Mental State Examination.Med Care. 2006 Nov;44(11 Suppl 3):S134-42. doi: 10.1097/01.mlr.0000245251.83359.8c. Med Care. 2006. PMID: 17060820 Review.
Cited by
-
A Generalized Multi-Detector Combination Approach for Differential Item Functioning Detection.Appl Psychol Meas. 2024 Dec 19:01466216241310602. doi: 10.1177/01466216241310602. Online ahead of print. Appl Psychol Meas. 2024. PMID: 39713763 Free PMC article.
-
Using moderated nonlinear factor models to adjust for differential item functioning in the Student-Teacher Relationship Scale from kindergarten to Grade 6.J Sch Psychol. 2024 Aug;105:101324. doi: 10.1016/j.jsp.2024.101324. Epub 2024 May 25. J Sch Psychol. 2024. PMID: 38876547 Free PMC article.
-
Characterizing the hierarchical depression phenotype in sexually diverse individuals.J Psychiatr Res. 2024 May;173:157-162. doi: 10.1016/j.jpsychires.2024.03.005. Epub 2024 Mar 11. J Psychiatr Res. 2024. PMID: 38531146 Free PMC article.
-
Enhancing Precision in Predicting Magnitude of Differential Item Functioning: An M-DIF Pretrained Model Approach.Educ Psychol Meas. 2024 Oct 1:00131644241279882. doi: 10.1177/00131644241279882. Online ahead of print. Educ Psychol Meas. 2024. PMID: 39554774 Free PMC article.
References
REFERENCES
-
- American Educational Research Association, American Psychological Association, National Council on Measurement in Education, & Joint Committee on Standards for Educational and Psychological Testing (U.S.). (2014). Standards for educational and psychological testing. American Educational Research Association.
-
- Bauer, D. J., Belzak, W. C. M., & Cole, V. T. (2020). Simplifying the assessment of measurement invariance over multiple background variables: Using regularized moderated nonlinear factor analysis to detect differential item functioning. Structural Equation Modeling: A Multidisciplinary Journal, 27, 43-55. https://doi.org/10.1080/10705511.2019.1642754
-
- Barendse, M. T., Oort, F. J., & Garst, G. J. A. (2010). Using restricted factor analysis with latent moderated structures to detect uniform and nonuniform measurement bias: A simulation study. AStA Advances in Statistical Analysis, 94, 117-127. https://doi.org/10.1007/s10182-010-0126-1
-
- Bauer, D. J. (2017). A more general model for testing measurement invariance and differential item functioning. Psychological Methods, 22, 507-526. https://doi.org/10.1037/met0000077
-
- Bauer, D. J., & Hussong, A. M. (2009). Psychometric approaches for developing commensurate measures across independent studies: Traditional and new models. Psychological Methods, 14, 101-125.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources